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Preface 


What  contributions  can  current  research  in  cognitive  psychology  make  to  the 
solution  of  problems  in  instructional  design?  This  volume  presents  responses  to 
this  question  from  some  of  the  best  workers  in  an  emerging  field  that  1 have 
labeled  “Cognition  and  Instruction”:  people  concerned  with  the  investigation  of 
the  cognitive  processes  involved  in  instructional  situations.  The  focus  of  this 
volume  was  presaged  by  comments  made  in  a previous  volume  on  cognitive 
psychology  (Forehand,  1974): 

In  what  seems  remarkably  few  years,  information-processing  psychology  has  come  to 
dominate  the  experimental  study  of  complex  human  behavior.  That  rapid  success 
encourages  me  to  speculate  that  within  a comparably  short  time  the  approach  will  have 
as  much  of  an  impact  on  psychology  in  the  field  as  it  has  had  on  psychology  in  the 
laboratory.  In  particular,  its  potential  for  illuminating  recalcitrant  problems  in  education 
seems  evident  [p.  159], 

The  chapters  in  this  volume  indicate  the  extent  to  which  this  potential  has 
already  begun  to  be  realized. 

The  book  is  divided  into  four  parts.  The  fust  three  parts  include  sets  of  re- 
search contributions  followed  by  discussions,  and  the  fourth  part  contains 
three  chapters  that  offer  critiques,  syntheses,  and  evaluations  of  various  aspects 
of  the  preceeding  papers. 

The  chapters  in  Part  I represent  different  strategies  for  instructional  research. 
In  the  first  chapter,  Carroll,  raising  some  of  the  Issues  facing  psycholinguists 
theory,  asks  whether  we  yet  know  enough  to  intentionally  teach  language 
skills  according  to  a systematic  instructional  theory.  He  summarizes  three 
lines  of  theoretical  development-naive,  behavioral,  and  cognitive-that  bear 
upon  the  issue,  and  finally  suggests  that  an  information-processing  view  of 
the  cognitive  processes  underlying  language  behavior  may  ultimately  provide  the 
basis  for  a theory  of  language  instruction.  In  Chapter  2,  Calfee  presents  a 
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research  strategy  that  focuses  upon  the  interpretation  of  the  empirical  results 
obtained  in  both  the  laboratory  and  in  instructional  settings.  He  points  out 
the  potential  pitfalls  awaiting  the  instructional  evaluator  who  has  not  care- 
fully  considered  all  possible  sources  of  interdependency  in  the  cognitive  models. 
The  statistical  analyses  proposed  by  Calfee  may  be  useful  to  those  faced  with  the 
task  of  identifying  the  extent  and  the  pattern  of  the  effects  of  instruction. 

Resnick  focuses  upon  the  area  of  early  mathematics  instruction,  and  she 
reviews  and  evaluates  the  precursors  of  current  procedures  in  task  analysis.  Her 
contribution  traces  the  development  of  a strategy  in  instructional  research  that 
utilizes  information-processing  models  of  cognition  to  meet  the  practical 
demands  of  creating  effective  instructional  procedures. 

Atkinson  provides  a glimpse  of  the  latest  products  of  his  extensive  research 
program,  which  is  aimed  at  developing  what  he  calls  “adaptive  instructional 
systems.”  His  research  stra'-  gy  is  based  upon  the  view  that  “an  all-inclusive 
theory  of  learning  is  not  a prerequisite  for  the  development  of  optimal 

procedures.”  ... 

Part  I concludes  with  discussions  by  Gregg  and  Olson,  and  their  commen  s 
further  emphasise  the  variety  of  strategic  approaches  to  research  on  instruction. 
Gregg  argues  for  the  importance  of  understanding  and  representing  the  learner  s 
strategies  in  instructional  situations,  whereas  Olson  raises  the  issue  of  the 
ultimate  social  utility  of  what  we  decide  to  teach  to  children. 

The  chapters  in  Part  II  focus  upon  process  and  structure  in  learning.  I he 
emphasis  is  upon  the  precise,  explicit,  and  detailed  representation  of  what  is 
learned,  how  it  is  utilized,  and  how  it  is  modified.  In  Chapter  7,  Greeno 
demonstrates  what  such  an  extensive  representation  might  look  like.  He  provi  es 
an  elaborate  statement  of  the  cognitive  objectives  for  three  different  areas, 
elementary  arithmetic,  high-school  geometry,  and  college-level  psychophysics. 
Knowledge  in  each  area  is  represented  by  a different  collection  of  building 
blocks  taken  from  current  information-processing  theories. 

One  of  the  central  issues  in  instructional  research  is  how  new  knowledge  is 
acquired.  Hyman,  in  Chapter  8,  describes  a paradigm  for  exploring  the  ways  in 
which  memory  is  restructured  when  new  information  is  discrepant  ,rom  pre- 
existent stereotypes.  Hyman  uses  a paradigm  borrowed  from  social  psychology 
studies  of  impression  formation,  and  shows  that  it  has  implications  for  the  more 

general  issue  of  information  acquisition. 

In  Chapter  9,  Norman,  Gentner,  and  Stevens  utilize  tools-some  of  them 
already  described  by  Greeno-*o  define  the  general  notion  of  “schema.  Ihe 
analysis  by  Norman,  Gentner,  a d Stevens  is  extremely  fine  grained,  they 
develop  detailed  representations  for  an  increasingly  rich  understanding  of  such 
basic  concepts  as  “give”  and  “buy.”  They  argue  that  such  representations  make 
it  possible  to  be  quite  precise  about  how  instruction  should  proceed. 

Shaw  and  Wilson,  in  Chapter  10,  address  the  issues  of  process  and  structure 
from  a more  abstract-almost  philosophical-position,  but  they  also  provide 
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concrete  examples  from  Shaw’s  work  on  perception.  The  central  issues  concern 
the  ability  to  understand  an  t ntire  concept  from  experience  with  just  a subset  of 
its  instances.  Such  an  abili  >,  Shaw  and  Wilson  argue,  lies  at  the  heart  of  an 
understanding  of  invariance. 

The  discu-Jors  by  Fan  ham-Diggory  (Chapter  1 1)  and  by  Hayes  (Chapter  12) 
offer  stimulating  critiques  of  the  positions  presented  in  Part  II.  Citing  an 
alarmingly  modem  inst  uctional  program  devised  over  half  a century  ago,  Fam- 
ham-Diggory  asks  first  "What’s  new?’’  and  then  “Is  it  better?"  Hayes  suggests 
some  wavs  that  one  can  begin  to  train  students  directly  in  cognitive  skills.  He 
focuses  upon  a recurring  theme  in  the  chapters  of  Part  II:  “What  does  the 
student  know  about  his  own  cognitive  processes9” 

An  essential  but  neglected  element  in  instructional  research  is  the  role  of 
instructions  per  se,  and  the  contributions  to  Part  III  focus  upon  the  processes 
that  underlie  the  comprehension  of  verbal  instructions.  Just  and  Carpenter  take 
the  sentence  as  their  unit  of  analysis.  Using  a sophisticated  and  explicit  model  of 
sentence  processing,  they  are  able  to  account  for  an  impressive  variety  of 
empirical  u ults.  Then  they  suggest  ways  in  which  larger  units,  such  as  those 
used  in  reading  comprehension  tests,  could  be  analyzed  similarly.  Simon  and 
Hayes  take  a larger  unit  of  analysis-the  entire  instruction  set.  They  report  on 
the  development  of  an  information-processing  model  aimed  at  explaining  the 
processes  that  underly  the  understanding  of  instructions  for  complex  puzzles. 
Then,  using  the  unambiguous  components  of  their  model  as  points  of  reference, 
they  sketch  the  broader  implications  that  a theory  of  understanding  could  have 
for  instructional  research  and  practice. 

In  the  discussions  in  Part  III,  Collins  (Chapter  15)  and  Shaw  (Chapter  It') 
suggest  areas  for  extension  of  the  models  of  comprehension  described  earlier. 
Collins  asks  about  the  nature  of  the  comparison  process-a  basic  unitary  process 
in  the  Just  and  Carpenter  model-and  speculates  that  it  might  itself  be  composed 
of  even  more  elementary  subprocesses.  Another  isiue  raised  by  Collins  is  the  role 
played  by  the  broader  knowledge  base  in  which  the  comprehension  processes  for 
sentences  or  task  instructions  operate. 

Shaw’s  comments  range  somewhat  farther  afield,  touching  on  the  papers  in 
Part  II  as  well  as  those  in  Part  III.  He  outlines  programs  in  two  diverse  areas-art 
instruction  and  treatment  of  aphasia— that  derive  from  a theory  of  compre- 
hension that  draws  upon  elements  of  the  models  presented  in  many  of  the 
previous  chapters. 

The  three  chapters  in  the  fourth  and  final  section  represent  responses  to  many 
of  the  issues  raised  in  previous  chapters.  Glaser  (Chapter  17)  addresses  the  issue 
of  how  we  can  take  the  results  of  scientific  research  and  apply  them  to  practical 
problems.  He  argues  for  the  development  of  a linking  science-a  science  of 
instructional  design- that  would  transform  our  knowledge  of  cognitive  processes 
into  instructional  procedures  while  at  the  same  time  providing  tests  and  chal- 
lenges for  the  existing  theories.  Cazden  (Chapter  !8)  raises  some  very  practical 
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questions  based  upon  her  varied  experience  as  both  a classroom  teacher  and  a 
research  psychologist.  One  example  of  the  kind  of  issue  that  is  central  to  a 
theory  of  instruction  but  still  inadequately  handled  by  our  current  theories  is 
Cazden’s  question.  “What  is  the  value  of  practice?”  Finally,  Klahr  (Chapter  19) 
sketches  some  of  the  issues  that  would  need  to  be  resolved  before  one  could 
construct  a model  of  a learner 
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Part  I 

STRATEGIES  FOR 
INSTRUCTIONAL  RESEARCH 


It  is  often  thought  and  said  that  what  we 
most  need  in  education  is  wisdom  and  broad 
understanding  of  the  issues  that  confront  us. 
Not  at  all,  I say.  What  we  need  are  deeply 
structured  theories  in  education  that  drasti- 
cally reduce,  if  not  eliminate,  the  need  for 
wisdom.  I do  not  want  wise  men  to  design 
or  build  the  airplane  I fly  in,  but  rather 
technical  men  who  understand  the  theory  of 
aerodynamics  and  the  structural  properties 
of  metal . . . And  so  it  is  with  educa- 
tion ...  I want  to  see  a new  generation  of 
trained  theorists  and  an  equally  competent 
band  of  experimentalists  to  surround  them, 
and  I look  for  a day  when  they  will  show 
that  the  theories  I now  cherish  were  merely 
humble  way  stations  on  the  road  to  the 
theoretical  palaces  they  have  constructed 
(Suppes.  1974). 
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Promoting  Language  Skills: 
The  Role  of  Instruction 

John  B.  Carroll 
Educational  Testing  Service 1 


Can  language  skills  be  taught?  The  answer  to  this  question  depends  upon  how  we 
define  “language  skills”  and  what  we  mean  by  “teaching.”  There  appears  to  be  a 
fundamental  divergence-usually  between  behavioral  scientists  on  the  one  hand 
and  educators  on  the  other-as  to  what  these  terms  mean. 

In  the  context  of  behavioral  science,  instruction  is  often  taken  to  mean 
definite,  specifiable  “behavioral  objectives,  high! /-controlled  instructional  set- 
tings and  materials,  and  definite  procedures  for  observing  and  measuring  learning 
outcomes.  But  in  the  minds  of  educators,  it  is  generally  the  case  that: 

Instruction’  is  a word  within  the  system  (education,  that  has  no  operational  defini- 
tion. It  refers  to  many  different  ways  in  which  the  relationships  among  students, 
teachers,  learning  materials  may  be  structured.  Discursive  situations,  at  all  levels  of 
instruction,  tend  to  be  seen  as  effective.  They,  and  other  types  of  structured  situations, 
are  being  defended  against  displacement  by  instruction  geared  only  to  operationalized 
episodes  [Dickinson,  1971,  p.  112]. 

Even  McKeachie  (1974),  a behavioral  scientist,  is  inclined  to  express  his  unhappi- 
ness with  the  term  “instructional  psychology,”  “for  ‘instruction  carries  n 
connotation  of  teacher  direction  or  building  that  is  less  pleasing . . . than  the 
emphasis  on  the  student  implied  in  ‘learning’  [p.  162] .” 

Dispute  over  the  meaning  of  “instruction”  and  “teaching”  is  found  also  among 
educational  philosophers.  It  is  commonly  agreed  that  teaching  is  any  activity 
that  is  designed  to  result  in  learning  on  the  part  of  the  individual  being  taught, 
but  there  is  debate  as  to  whether  such  an  activity  should  be  called  teaching  when 
there  is  no  intent  on  the  part  of  the  teacher  to  teach,  or  when  it  is  not  successful 
in  producing  its  intended  outcome  (Scheffler,  1960). 

1 Currently,  Department  of  Psychology,  University  of  North  Carolina  at  Chapel  Hill. 
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Consider  the  claim  that  the  child  leams  his  native  b ,uage  without  being 
“taught,”  simply  by  “exposure”  to  adult  models.  To  support  such  a claim,  one 
would  have  to  have  in  mind  how  he  distinguishes  between  teaching  and  non- 
teaching, and  how  he  means  to  define  “exposure.”  On  the  other  hand,  it  is 
conunonly  accepted  that  one  can  “teach”  vocabulary  knowledge,  or  a foreign 
language. 

If  we  are  to  study  rationally  the  problems  of  teaching  language  skills,  we  must 
embrace  such  concepts  as  “creativity  in  language”  within  a scientific,  deter- 
ministic framework.  If  there  is  'uch  a thing  as  a natively  predetermined  “lan- 
guage acquisition  device”  (McNeill,  1970)  that  accounts  for  the  acquisition  of 
language  skills,  we  must  describe  it  scientifically.  If  the  system  of  language  is 
“intema,ized”  by  language  learners,  the  resultant  internalized  states  must  be 
open  to  scientific  study  by  appropriate  observation  of  the  “behavior”  (broadly 
defined)  that  occurs  under  specifiable  conditions.  Some  of  these  “specifiable 
conditions”  will  fall  under  the  concept  of  “instruction,”  but  I assume  that  they 
will  cover  not  only  the  kinds  of  deliberate,  formal  operations  that  a teacher 
performs  in  the  classroom,  but  also  the  informal,  largely  nondeliberate  actions  of 
people  interacting  wfh  each  other  through  language  and  other  means,  for 
example,  the  interactions  of  a mother  and  her  child,  or  the  interactions  of  one 
student  with  another  in  a “discursive”  situation.  Whether  these  actions  are  taken 
with  an  “intent”  to  teach  or  produce  learning,  and  whether  these  actions  are 
“successful”  in  producing  learning,  are  questions  that  are  not  of  central  interest. 
It  does  not  much  matter  whether  or  not  we  say  that  the  child  leams  his  language 
“without  being  taught.”  What  matters  is  what  external  influences,  that  we  might 
be  able  to  have  under  our  option  or  control,  there  are  upon  the  child’s  learning. 
There  are  many  kinds  of  “language  skills”:  speaking,  listening,  reading,  hand- 
writing, spelling,  and  written  composition  are  the  native-language  skills  that  are 
given  most  attention  in  the  schools,  but  we  might  also  want  to  discuss  what  are 
often  called  “communication  skills,”  including  nonverbal  communication  skills. 
In  all  these  skills,  there  is  a developmental  dimension  as  the  individual  moves 
from  infancy  to  adulthood.  In  a previous  publication  (Carroll,  1971b)  I have 
reviewed  the  literature  on  the  development  of  these  native-language  skills  be- 
yond the  early  years.  In  addition,  we  may  want  to  consider  the  problems  of 
teaching  a second  or  a foreign  language,  or  of  teaching  a “standard  form  of  a 
language  when  the  learner's  native  tongue  is  a “nonstandard”  form  of  that 
language.  I have  reviewed  research  on  many  aspects  of  hese  matters  in  a number 
of  publications  (Carroll,  1963,  1966,  1968a,  1971a),  and  I do  not  intend  to 
recapitulate  these  reviews  here.  Instead,  I propose  to  focus  attention  on  the 
models  of  the  language  learner  that  seem  to  be  implicitly  assumed  by  teachers, 
writers  of  instructional  materials,  and  others  in  education,  as  well  as  s ich  models 
as  are  offered  by  psychologists,  psycholinguists,  and  linguists.  We  must  see  m 
what  respects  these  models  are  inadequate  or  conflict  with  one  another.  We  ast 
also  attend  to  what  role  these  models  assume  for  “instruction’-defined  broadly 
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as  any  external  influences  on  the  development  of  language  skills,  as  represented 
both  by  formal  teaching  actions  and  by  more  informal  social  interactions. 


NOTIONS  OF  SKILL,  COMPETENCE,  AND  PERFORMANCE 

If  we  are  to  begin  promoting  language  skills,  we  need  a notion  of  what  these 
skills  consist  of.  Indeed,  it  would  be  to  our  advantage  if  we  had  available  a 
complete  theory  of  how  people  acquire  and  use  language  skills-both  productive 
and  receptive  skills,  and  both  skills  with  the  spoken  language  and  skills  with  the 
written  forms  of  language. 

One  prerequisite  for  the  development  of  such  a theory  is  consideration  of  the 
relation  between  a language  system,  as  described  by  linguists,  and  the  activities 
and  behaviors  that  involve  its  use.  In  recent  years,  this  problem  has  been 
discussed  in  terms  of  the  distinction,  most  trenchantly  formulated  by  Chomsky 
(1965),  between  “competence”  and  “performance.”  The  distinction  has  been 
debated  almost  ad  nauseam  (Bever,  1970;  Fillenbaum,  1971;  Fodor  & Garrett, 
1966;  Hayes,  1970)  and  it  would  be  a distraction  to  fully  discuss  die  matter 
here,  but  since  I have  a particular  viewpoint,  I need  to  state  my  position  with 
some  semblance  of  justification.  I believe  that  all  Chomsky  literally  meant  to 
refer  to  was,  on  the  one  hand,  what  is  learned  (competence),  and  on  the  other, 
the  behavior  that  manifests  that  learning  (performance),  including  both  receptive 
and  productive  language  behaviors.  The  notion  of  competence  is  entirely  neutral 
as  to  what  type  of  grammatical  model  should  describe  competence.  Chomsky 
offered  generative  grammars  as  theories  of  competence,  but  linguists  (and 
others)  are  free  to  select  other  kinds  of  grammar  to  describe  competence. 
Further,  the  notion  of  performance  is  neutral  as  to  what  theory  or  model  of 
performance  mechanisms  one  might  adopt;  a model  of  performance  mechanisms 
might  be  derived  from  behavioristic  principles,  from  cognitive  psychology,  or 
from  any  other  psychological  system.  Much  of  the  discussion  about  competence 
and  performance,  however,  has  been  concerned  with  the  extent  to  which  a 
model  of  performance  must  “incorporate”  a competence  model,  and  if  so, 
whether  the  competence  model  (i.e„  the  type  of  grammar  chosen)  determines 
the  form  of  the  performance  model.  In  my  opinion  the  determination  is  in  the 
opposite  direction:  the  form  chosen  for  the  performance  model  will  tend  to 
dictate  the  form  of  the  competence  model  and  therefore  the  form  of  the 
grammar.  This  is  the  case  because  the  mechanisms  or  processes  that  a perfor- 
mance model  assumes  are  not  indifferent  to  their  content,  i.e.,  to  the  elements 
upon  which  they  operate. 

There  are  perhaps  many  possible  ways  to  write  grammars  for  verbal  output, 
but  the  type  of  grammar  that  makes  psychological  (sense  is  determined  by  the 
kinds  of  mechanisms  that  are  assumed  in  the  performance  model.  There  are 
various  alleged  demonstrations  (e.g.,  Bever,  Fodor,  & Garrett,  1968)  th*t  a 
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transformational  grammar  cannot  be  handled  by  an  associations  tic,  “stimulus- 
response”  theory.  This  is  usually  interpreted  to  mean  that  a behavioristic 
account  of  language  behavior  is  unacceptable-so  great  is  the  faith  in  transforma- 
tional grammar.  Suppose,  however,  that  no  mechanism  can  be  found  to  handle 
such  a grammar.  (In  fact,  I am  not  aware  that  anyone  has  formulated  such  a 
mechanism.)  This  would  present  a problem  of  the  psychological  acceptability  of 
transformational  grammar  for  any  performance  model. 

My  conclusion  is  that  the  designer  of  a performance  model  ccn  afford  to  be 
indifferent  concerning  what  type  of  grammatical  theory  the  linguist  may  want  to 
choose  to  satisfy  his  or  her  own  criteria.  As  a psychologist,  my  cri*eria  lie  within 
the  realm  of  psychology.  Thus,  the  land  of  grammar  I choose  must  satisfy  the 
basic  psychological  criterion  that  it  must  be  plausible  from  the  standpoint  of 
being  capable  of  being  handled  by  known  or  discoverable  psychological  pro- 
cesses. In  effect,  this  means  that  a grammar  for  a given  language  system  must  be 
included  within  the  performance  model  for  that  language,  the  distinction  be- 
tween competence  and  performance  remains  as  before-competence  refers  to 
what  is  learned,  performance  refers  to  a behavior  manifesting  that  learning. 

This  poL  it  of  view  is  actually  not  as  heretical  as  it  may  seem.  Labov  (1971,  p. 
452)  says.  “There  seems  to  be  general  agreement  that  a valid  theory  of  language 
must  eventually  be  based  on  rules  that  speakers  actually  use.”  Bever  (1970,  p. 
345)  tells  his  readers  that  if  they  will  “accept  the  possibility  that  ongoing  speech 
behavior  does  not  utilize  a linguistic  grammar,”  they  will  not  be  surprised  “that 
the  mechanisms  inherent  to  ongoing  speech  behavior  do  not  manifest  transfor- 
mations or  any  operations  directly  based  on  them.” 

A point  of  view  that  seems  much  closer  to  mine  is  that  of  Schlesinger(1971), 
who  writes: 

There  i»'.  no  place  for  intentions  in  a grammar,  but  any  theory  of  performance  which 
faili  to  tyke  intentions  into  account  must  be  considered  inadequate.  The  model  of  a 
human  speaker  must,  of  course,  contain  rules  that  determine  the  grammatical  structure 
of  the  output  These  rules,  however,  must  be  assumed  to  operate  on  an  input  which 
represents  the  speaker’s  intentions  (p.  64] . 

I would  identify  these  rules  as  a grammar  incorporated  into  a performance 
grammar-in  Schlesinger’s  terms,  they  would  be  "realization  rules”  for  convert- 
ing “I  markers”  (input  or  intention  markers)  into  utterances.  There  is  a certain 
similarity  here  to  Bever’s  (1970,  p.  286)  notion  that  “talking  involves  actively 
mapping  internal  structures  onto  external  sequences,  and  understanding  others 
involves  mapping  external  sequences  onto  internal  structures”-that  is,  if  we 
identify  internal  structures  with  Schlesinger’s  I markers.  Much  of  Bever’s  paper  is 
concerned  with  trying  to  identify  “heuristics”  or  “cognitive  strategies”  whereby 
the  hearer  finds  out  how  “external  sequences”  (i.e.,  strings  of  speech)  are  to  be 
mapped  into  internal  structures.  While  it  is  debatable  whether  he  has  identified 
the  heuristics  that  language  users  actually  employ,  the  enterprise  seens  to  be  in 
the  right  direction. 
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To  emphasize  the  claim  that  the  grammar  must  be  incorporated  in,  and 
determined  by,  the  performance  model,  I have  called  the  that  1 have 

developed  for  a small  subset  of  English  sentences  a “performance  grammar” 
(Carroll,  1974a).  This  performance  grammar  thus  far  centers  attention  on 
language  production;  it  is  my  belief  that  the  problem  of  production  must  be 
dealt  with  before  problems  of  reception  and  comprehension  can  properly  be 
investigated.  This  is  because  the  hearer’s  problem  is  to  determine  the  I marker  of 
the  speaker;  it  seems  reasonable  to  suppose  that  to  the  extent  that  speaker  and 
hearer  share  the  same  language  system,  die  hearer  would  rely  to  a large  extent  on 
the  same  “realization  rules”  for  converting  I markers  into  speech  that  the 
speaker  does.  The  performance  grammar  is  conceived  of  as  having  two  compo- 
nents: the  intentive  component,  and  the  code  component.  The  intentive  com- 
ponent specifies  the  elements,  variables,  and  structures  found  in  I markers,  and 
the  code  component  contains  the  “realization”  rules  for  converting  the  contents 
of  I markers  into  grammatically  acceptable  speech.  The  rules  in  the  code 
component  can  be  stated  as  “production  systems"  in  Newell’s  (1973)  sense,  i.e., 
they  can  be  stated  in  the  form  of  one  or  more  condition-action  pairs.  This  type 
of  grammar,  incidentally,  is  exemplified  also  by  Halliday’s  systemic  grammar 
(Hudson,  1971;  Muir,  1972),  which  Morton  (1968)  calls  a “Category  B”  gram- 
mar that  describes  how  language  behavior  can  be  produced  outside  the  rules  of 
grammars  of  a more  linguistic  character,  like  my  performance  grammar,  Halli- 
day’s systemic  grammar  emphasizes  the  choices  open  to  the  speaker  as  he  speaks, 
but  I would  feel  that  the  “intentive”  component  of  Halliday’s  grammar  is  as  yet 
only  a latent  structure;  i.e.,  the  conditions  for  the  choices  are  not  made  explicit, 
whereas  they  would  have  to  be  in  a complete  performance  grammar. 

Discussions  of  “ ompetence”  in  linguistics  have  laid  little  emphasis  on  whether 
the  competence  may  vary  from  one  speaker  to  another,  or  whether  competence 
can  be  quantified.  Muscat-Tabakowska  (1969)  has  presented  an  interesting 
discussion  of  these  as  issues  they  apply  in  foreign-language  teaching;  her  remarks 
are  applicable  also  to  the  competence  of  native  speakers.  She  narrows  the 
definition  of  competence  to  mean  “tne  actual  knowledge  of  the  underlying 
system  of  rules  at  a given  time,”  from  which  she  concludes  that  (1)  “compe- 
tence . . . can  be  learned,  and  probably  can  also  be  taught;  (2)  competence  is 
relative,  for  it  can  be  bigger  or  smaller,  both  in  different  speakers  at  the  same 
time  and  in  the  same  speakers  at  different  times;  and  (3)  competence  is 
measurable,  in  that  it  is  possible  to  infer  the  amount  of  competence  from  the  ob- 
servable data  (from  performance) . . . [Muscat-Tabakowska,  1969,  pp.  42-43] . 

Elsewhere  (Carroll,  i968b)  1 have  set  forth  a series  of  propositions  about 
competence  and  performance  in  their  application  to  problems  of  testing  compe- 
tence in  a foreign  language,  but  they  are  equally  applicable  to  similar  problems 
in  a speaker’s  native  language.  Among  these  are: 

Competence  in  a language  consists  of  a series  of  interrelated  habits  (acquired  stimulus- 
response  mechanisms)  which  can  be  described  in  terms  of  stated  rules  [p.  47] . 
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nx;  actual  manifestation  of  linguistic  competence ...  in  behavior  may  be  called 
Unguis  be  performance,  and  is  affected  by  a large  number  of  nonlinguistic  variables  [p. 

I further  asserted  that: 


There  are  individual  differences,  both  in  competence  and  performance  variables,  that 

may  be  a function  of  either  constitutional  or  experiential  variables  [p.  51  ] . 

I pointed  out  that  individual  differences  in  competence  might  be  found  in 
different  domains,  such  as  phonology,  morphology,  lexicon,  and  grammar,  and 
that  individual  differences  in  performance  could  be  observed  in  such  matters  as 
speed  of  response,  diversity  of  response,  complexity  of  information  processing, 
and  awareness  of  competence. 

Such  an  analysis  of  linguistic  competences  and  performances  suggests  that  it  is 
quite  possible  that  a diversity  of  detailed  psychological  models  nay  need  to  be 
incorporated  in  a complete  performance  model.  For  example,  the  psychological 
model  used  to  study  the  acquisition  of  a lexical  item  as  a linguistic  form  may  be 
different  from  the  one  used  to  study  the  meaning  of  that  linguistic  form,  and  a 
still  different  model  may  be  required  to  account  for  the  acquisition  of  the 
grammatical  category  and  distributional  characteristics  of  the  form.  Further, 
models  foi  the  acquisition  of  phonological  items,  or  of  grammatical  rules,  may  be 
radically  different  from  any  of  the  models  required  in  connection  with  lexical 
forms.  We  may  already  be  able  to  apply  certain  standard  paradigms  (Melton, 
1964)  to  several  of  these  cases:  for  example,  acquisition  of  lexical  meanings  may 
be  a case  of  associationistic  learning,  or  a case  of  concept  learning;  and  acquisi- 
tion of  phonological  competence  may  have  elements  of  perceptual  learning  ai.d 
of  psychomotor  learning.  Where  our  standard  paradigms  seem  to  fail  most  is  in 
explaining  the  acquisition  of  grammatical  rules.  It  is  still  unclear  what  the  source 
of  the  difficulty  may  be:  is  it  that  appropriate  psychological  models  have  not 
even  been  discovered,  let  alone  refined,  or  is  it  that  we  have  not  discovered  the 
way  in  which  grammatical  rules  should  be  formulated  so  as  to  lend  themselves  to 
the  application  of  psychological  models?  I suspect  there  are  difficulties  on  both 
of  these  counts. 


NAIVE,  BEHAVIORISTIC,  AND  COGNITIVE  THEORIES 
OF  LANGUAGE  LEARNING 

What  happens  when  people  (or  other  organisms)  acquire  la,igtage  skills?  What 
models  of  the  language  learning  process  seem  to  be  assumed  by  their  teachers,  or 
by  people  who  prepare  instructional  materials? 

That  people  do  learn  language,  even  when  taught  by  teachers  (e.g.,  mothers) 
uninformed  about  any  systematic  scientific  principles  of  learning,  is  evident. 
Whether  people  leam  any  better  when  they  are  taught  according  to  some 
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system* -'c  theory  of  instruction  is  not  so  ivident.  Even  if  the  teacher,  or  the 
preparer  of  in  .tru  -tional  materials,  can  V -rumed  to  have  been  influenced  by 
some  doctrine  about  learning  and  teaching,  i is  hard  to  tell,  from  an  instance  of  his 
or  her  teaching,  whether  he  or  she  is  actuary  being  guided  by  that  doctrine,  unless 
he  or  she  explicitly  tells  us  so. 

I will  discuss  three  “theories”  of  language  learning  and  teaching.  I assume, 
first,  that  a “naive”  or  “common  sense”  theory  of  learning  exists,  that  in  fact  it 
has  existed  for  cei  tunes,  and  that  this  “naive”  theory  underlies  the  instructional 
procedures  used  by  most  people  engaged  in  promoting  language  skilis-whether 
they  be  mothers  teaching  their  children  to  speak  or  formally  certified  teachers  of 
English  or  foreign  languages.  I do  not  employ  the  term  “naive”  in  a pejorative 
sense,  but  rather  to  refer  to  the  kind  of  “common-sense  psychology,”  described 
by  Heider  (1958,  pp.  5-7),  as  summarizing  the  common  wisdom  that  people 
have  about  their  behavior  and  motives.  Of  course,  a naive  theory  of  learning  may 
in  many  respects  be  inaccurate,  wrong,  or  wrongly  applied.  Nevertheless,  it 
cannot  be  all  wrong,  since  it  has  been  part  of  the  underlying  foundation  of 
teaching  and  learning  over  the  centuries,  that  is,  the  kind  of  teaching  that  has 
been  at  least  partly  successful. 

Second,  I will  describe  how  “behavior  theory”  has  singled  out  for 
analysis  and  reinterpretation  certain  features  of  the  naive  theory.  By  “behavior 
theory,”  I mean  one  comparable  to  Skinner’s  (1953, 1957)  with  its  emphasis  on 
operant  conditioning. 

Third,  I will  indicate  some  limitations  of  Skinnerian  behavior  theory  and  point 
out  how  cognitive  theory  provides  a refinement  of  naive  theory  (and  a reinter- 
pretation of  behavior  theory).  The  discussion  will  thm  lead  to  the  implications 
of  cognitive  theory  for  instructional  procedures  in  promoting  language  skills. 

Assumptions  of  Naive  Learning  Theory 

If  we  examine  typical  instructional  materials,  ob  nstructional  episodes,  or 
talk  with  teachers,  we  can  infer  that  naive  ning  ory  is  based  on  eight 
principal  implicit  assumptions: 

1.  Learning  occurs  best  when  it  is  “motivate  Ideally,  maximum  learning 
occurs  when  the  individual  “wants  to  learn.”  Helen  Keller  (1936,  pp.  23-24) 
recalls  that  after  arriving  at  an  understanding  that  “everything  has  a name” 
through  being  shown  how  the  word  water  is  finger  spelled,  she  “left  the  well 
house  eager  to  learn.”  Most  textbooks  are  written  on  the  assumption  that  they 
will  be  used  by  “motivated”  students;  some  of  them  attempt,  however,  to 
stimulate  motivation  and  interest.  Naive  theory  further  assumes  that  if  an 
individual  does  not  want  to  learn,  he  can  nevertheless  be  made  to  learn  by 
drawing  his  attention  to  the  consequences  of  not  learning.  The  critical  role  of 
motivation  is  assumed  to  be  in  every  case  to  direct  and  focus  the  individual’s 
attention  on  what  is  to  be  learned. 
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2.  Thus,  a critical  variable  in  learning  is  attention,  i.e.,  a state  of  th(  learner 
whereby  he  becomes  consciously  aware  of  the  material  to  be  lear  ied  and 
examines  it  according  to  whatever  me*ns  are  necessary.  When  the  . ratter  is 
complex,  this  may  require  diligent  study,  but  even  in  simple  cases,  some  degree 
of  “attention”  is  required.  We  have  a report  (McNeill,  1970,  p.  1 06}  - >f  a mother 
trying  to  “correct”  her  child’s  tendency  of  saying  “Nobody  don  t like  me.” 
After  a series  of  unsuccessful  attempts,  she  says:  “Now  listen  carefully:  say 
‘nobody  likes  me.’  ” In  the  child’s  response,  “Oh!  Nobody  don’t  likes  me,”  it  is 
evident  that  the  child  does  in  fact  pay  greater  attention  to  the  stimulus  than 
before,  even  though  the  response  is  not  quite  what  the  mother  hoped  for.  A 
series  of  steps  recommended  by  Fitzgerald  (1951)  for  learning  to  spell  a word 
incluoc  admonitions  to  “look  at  the  word  . . . pronounce  it ...  see  the 
word  . . . say  it . . . make  every  letter  carefully.”  In  reporting  how  she  learned 
the  finger-spelling  for  water,  Keller  (1936)  notes  that  “her  whole  attention  was 
fixed  on  the  motion  of  her  teacher’s  fingers. 

A corollary  of  this  proposition  is  the  principle  “one  thing  at  a tii.re.”  It  is 
assumed  that  to  maximize  attention,  attention  should  be  directed  at  only  one 
thing  at  a time.  Divided  attention  and  distractions  retard  learning.  Thus,  if  the 
thing-to-be-leamed  is  complex,  its  parts  must  be  attended  to  separately.  Fitz- 
gerald (1951),  in  the  prescriptions  mentioned  earlier,  advises  the  student  to  “say 
the  letters  in  order"  and  “make  every  letter  carefully.”  Instructional  materials 
generally  attempt  to  focus  the  student’s  attention  on  particular  aspects  of  what 
is  to  be  learned. 

Another  corollary  is  that  the  learner  controls  the  learning  in  that  he  can 
control  his  attention,  and  is  generally  aware  of  how  much  and  how  well  he 
knows  he  has  learned  at  a given  point  (the  student,  of  course,  be  mistaken  about 
his  state  of  knowledge.). 

3.  The  result  of  learning  is  some  change  in  internal  state.  This  can  be  either  a 
change  in  state  of  knowledge  about  facts,  rules,  opinions,  etc.,  (“knowledge 
that...  ”),  or  a change  in  state  of  knowledge  about  procedures  and  behaviors 
( knowledge  ! ow  to . . . ”).  Knowledge  can  come  from  a number  of 
sources-experience,  observations  of  place,  events,  and  others' behavior,  lectures 
and  explanations,  or  even  from  mental  discovery-“using  one’s  head.”  Informa- 
tion may  be  stored  as  memories,  although  some  memories  can  be  forgotten. 
Memories  about  behaviors  are  stored  as  “habits.”  Knowledge  can  even  include 
information  about  how  to  learn:  Fitzgerald’s  prescriptions  about  learning  to 
spell  are  of  this  nature.  The  e assumptions  about  what  is  to  be  learned  are 
illustrated  in  a “bulletin”  suggested  for  use  in  a “better  speech  campaign”  for 
speakers  of  nonstandard  dialects  at  the  secondary  school  level  (Golden,  1960,  p. 
94).  Golden  assumes  that  in  order  for  the  nonstandard  speaker  to  learn  to  avoid 
usages  like  shouldn ’t  ought,  disremember,  and  irregardless,  he  or  she  first  needs 
to  be  told  that  they  are  “wastebasket”  words  even  though  there  is  “nothing 
really  wrong”  with  their  use  in  some  situations.  In  his  text  on  teaching  English  as 
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a second  language,  Dixson  (1971,  p.  2)  gives  rules  for  forming  the  negative  of  to 
be:  “We  form  the  negative  of  to  be  by  placing  not  after  the  verb.”  A widely  used 
textbook  of  Spanish  (Bolinger  et  al,  1960,  p.  57)  teaches  the  student  the 
distinction  between  Spanish  ser  and  estar  by  a lengthy  discussion  of  the  varying 
uses  and  meanings  of  these  forms;  that  is,  the  student  is  assumed  to  need  a store 
of  information  or  knowledge  about  them  in  order  to  learn  to  use  them  according 
to  Spanish  norms. 

4.  Practice  and  repetition  contribute  to  the  establishment  and  strengthening 
of  memories.  Memories  become  clearer  and  firmer  by  repeated  exposure  to  the 
subject  to  be  learned.  This  is  believed  to  be  true  both  for  memories  of  experi- 
ences and  for  memories  of  behaviors  (i.e , habits).  Retrieval  of  memories  for 
experiences  eventually  becomes  extremely  facile  after  repeated  exposure  to  the 
stimuli,  and  behavior  repeatedly  performed  becomes  extremely  “automatic” 
when  the  conditions  for  that  behavior  are  appropriate.  Fitzgerald’s  (1951) 
prescriptions  for  the  learning  of  spelling  advise  the  student  to  spell  a word 
several  times,  each  time  checking  its  correctness.  Golden  (1960)  advises  stu- 
dents: 

“This  shifting  and  perfecting  of  language  pattern  is  not  done  easily  or  overnight. ...  As 
it  takes  continuous  practice  and  many  other  factors  to  shift  from  being  merely  a 
chop-sticks  player  to  being  a good  musician,  so  it  takes  practice  and  thought  and  desire 
and  then  more  practice  to  shift  into  using  the  pattern  that  is  universal,  and  to  feel  so 
much  at  home  in  the  new  pattern  that  we  can  truly  ‘make  music’  with  it  [p.  94 ] . 

Bloomfield’s  (1942)  final  admonition  to  the  foreign  language  learner  is  “PRAC- 
TICE EVERYTHING  UNTIL  IT  BECOMES  SECOND  NATURE  [p.  16,  capitali- 
zation in  the  original] .” 

5.  Th,ere  are  degrees  of  learning,  and  until  perfect  mastery  is  attained,  re- 
sponses must  bf  checked  for  their  ''correctness  ” “Feedback”  has  the  primary 
function  of  giving  the  learner  information  which  he  can  use  to  compare  his  or 
her  response  with  what  the  response  should  be;  whether  it  “rewards”  or  punishes 
the  learner  is  of  secondary  concern.  We  have  already  cited  Fitzgerald’s  advice  to 
the  learner  to  check  the  correctness  of  his  efforts  to  spell  a word  each  time  he 
tries  it. 

6.  Rewards  are  administered  by  external  agencies  for  the  act  of  learning  (and 
punishments  for  failures  in  learning);  one  does  not  reward  or  punish  the  actual 
behavior  performed , but  the  learner  himself.  Rewards  and  punishments  are  seen 
as  constituting  information  to  the  learner  regarding  the  consequences  of  learning 
or  not  learning;  this  is  true  whether  the  rewards  and  punishments  are  adminis- 
tered verbally  or  physically.  Rewards  (school  grades,  “A,”  “B,”  etc.)  are  also 
given  to  convey  information  to  the  learner  concerning  his  overall  progress  in 
learning. 

7.  Learning  builds  on  prior  knowledge  and  habits.  Teachen  and  textbook 
authors  generally  mean  to  take  account  of  knowledge  and  habits  already  ac- 
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quiied  at  a given  point.  If  we  look  at  random  at  almost  any  page  of  a textbook, 
we  can  usually  infer  what  the  textbook  writer  assumes  the  studer  t knows  or  has 
learned  up  to  that  point. 

8.  Learning  is  an  active  process;  “learning  by  doing”  is  a wstchword  among 
many  educational  writers.  Textbook  authc  s are  aware  of  the  need  to  have 
students  be  able  to  make  active,  uncued  responses.  For  example,  in  the  Spanish 
textbook  cited  earlier  (Bolinger  et  aL,  1960,  p.  28),  it  is  pointed  out  that  the 
students"  books  must  be  closed  during  the  performance  of  a drill  on  petson- 
number  substitutions. 

The  above  assumptions  apply  not  oniy  to  language  learning  but  in  fact  to  most 
types  of  school  learning,  and  to  most  other  types  of  learning  as  well.  The  special 
difficulties  in  applying  these  assumptions  tt  certain  aspects  of  language  learning 
(e.g.,  the  child’s  lemming  of  his  native  language,  particularly  its  grammar)  are 
only  dimly  perceived  in  naive  theory,  but  a special  theory,  that  of  “imitation,”  is 
applied  to  explain  language  learning.  Naive  learning  theory  attaches  importance 
to  imitation  as  a learning  process  because  behavior  that  is  apparently  imitative  is 
frequently  observed.  Mothers  try  to  get  their  children  to  imitate  their  language 
and  are  sometimes  successful:  Kobashigawa  (1968)  reports  an  episode  in  which  a 
mother  elicits  a form  by  using  a question  intonation;  the  child  tends  to  imitate 
not  only  the  form  but  the  intonation,  and  imitates  a different  intonation  when 
the  mother  changes  hers: 

Mother:  That’s  a radio.  ...  A radio?  (with  question  intonation) 

Child:  [ we-oj  (with  question  intonation) 

Mother:  Radio,  (with  falling  intonation) 

Child:  [we-oj  (with  falling  intonation) 

McNeill  (1970,  p.  106)  reports  unpublished  material  from  Roger  Brown’s  re- 
search illustrating  children’s  (usually  shortened)  imitations  of  adult  sentences. 

Behavioral  Learning  Theory 

The  behavior  theory  developed  by  Skinner  (e.g.,  1953)  and  his  followers  focuses 
on  the  properties  of  what  are  called  operant  responses  and  the  conditions  that 
are  presumed  to  control  their  elicitation,  learning,  and  extinction.  The  paradigm 
of  classical  conditioning  is  played  down  in  this  theory  because  it  is  thought  to 
pertain  mainly  to  responses  of  the  autonomic  nervous  system,  responses  that  are 
considered  not  to  be  of  primary  interest  in  educational  settings.  Discussions  of 
classical  conditioning  rarely  figure  in  writings  about  the  application  of  behavior 
theory  in  instruction  (Skinner,  1963). 

The  salient  feature  of  behavior  theory  is  its  treatment  of  the  relations  among 
stimuli,  overt  responses,  and  reinforcements.  In  the  strict  form  of  behavior 
theory,  mental  events  and  covert  responses  are  assumed  to  be  of  no  scientific 
interest,  and  arc  therefore  not  considered.  It  is  assumed  that  changes  in  proba- 
bilities of  emission  of  overt  responses  are  functionally  related  to  the  occurrence, 
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at  specifiable  points  of  time,  of  “reinforcements”— stimuli  tending  * ' satisfy 
drives—,  and,  as  the  case  may  be,  also  to  the  occurrence  of  certain  other  stimuli 
(“discriminative  stimuli”)  that  may  serve  as  cues  for  the  emission  of  the  overt 
responses.  The  overt  responses  thus  come  under  the  “control”  of  reinforcements 
and  discriminative  stimuli  when  the  temporal  relations  and  other  conditions  arc 
as  prescribed  by  the  theory.  The  theory  is  also  much  concerned  with  'he 
“topography”  of  the  responses,  i.e.,  with  their  differentiation,  and  with  the 
manner  in  which  rewards  and  also  discriminative  stimuli  are  differentiated. 

It  is  useful  to  see  how  the  assumptions  of  behavior  theory  compare  with  those 
of  naive  theory: 

1.  Behavior  theory  agrees  that  the  learner  must  be  motivated,  but  it  sees  the 
problem  of  motivation  as  one  of  identifying  drives  for  which  reinforcers  can  be 
specified.  Many  applications  of  behavior  theory  involve  reinforcements  for  basic 
drives  such  as  lunger  and  thirst,  but  according  to  the  “Premack  principle” 
(1959)  any  activity  preferred  by  a learner  can  serve  as  a reinforcer  for  any 
less-preferTed  activity.  Thus,  in  conducting  “programmed  instruction,”  getting 
through  a program  might  be  regarded  as  a reinforcer  for  the  act  of  going  through 
a program,  on  the  assumption  that  the  student  would  rather  not  be  doing  a 
program  than  doing  it. 

2.  Any  consideration  of  “attention”  or  conscious  control  of  learning  is  not 
recognized  in  behavior  theory.  The  principle  of  “one  thing  at  a time,”  however, 
is  utilized  in  behavior  theory  simply  because  of  the  necessity  to  establish  precise 
temporal  relations  between  particular  responses  and  particular  rewards. 

3.  The  only  thing  that  behavior  theory  recognizes  as  being  learned  is  some 
overt  response  (or  some  integrated  combination  of  responses),  which  occurs 
under  appropriate  circumstances  or  stimulus  conditions.  A strict  form  of  behav- 
ior theory  makes  no  assumptions  about  “information,”  “memories,”  “knowl- 
edge,” or  even  “habits,”  although  if  a response  is  "reliably”  established  it  is 
sometimes  loosely  referred  to  as  a habitual  response. 

4.  Matters  having  to  do  with  the  practice  of  responses  and  repeated  exposures 
to  stimuli  are  dealt  with  under  the  rubric  of  “schedules  of  reinforcement,”  i.e., 
with  the  specification  of  the  temporal  relations  and  repetitions  of  stimuli, 
responses,  and  reinforcing  events.  Some  schedules  are  found  more  effective  in 
producing  learning  than  others.  “Forgetting”  of  responses  would  be  interpreted 
as  extinction  of  those  responses  resulting  from  an  ineffective  schedule  of 
reinforcement. 

5.  Feedback  is  considered  to  be  a form  of  reinforcement;  it  applies  to  the 
learner’s  response,  not  to  the  learner. 

6.  Reward  is  obviously  of  central  importance;  like  feedback,  it  applies  to  the 
learner’s  response  and  not  to  the  learner.  Positive  reinforcements  are  believed  to 
be  more  effective  than  negative  ones;  insofar  as  feedback  regarding  incorrect 
responses  is  aversive,  the  conditions  for  learning  should  be  arranged  so  that  the 
learner  makes  a minimum  of  errors. 

x 
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7.  hi  the  theory,  there  is  no  such  thing  u prior  knowledge;  there  are  only 
behaviors  and  responses  that  have  been  learned  previously.  These  previously 
learned  responses  are  to  be  taken  account  of  as  “baseline”  or  “entry”  behavior 

which  may  in  fact  be  prerequisite  for  further  learning  and  for  building  integrated 
response  repertoires.” 

8.  Since  only  overt  responses  are  learned,  learr  Jig  is  obviously  “active.”  The 
presc  iptions  of  naive  theory  about  active  learning  are  interpreted  as  referring  to 
the  necessity  of  fading  irrelevant  cues. 

Despite  a good  deal  of  publicity  and  experimentation,  it  can  hardly  be  said 
that  behavior  theory  has  become  popular  with  all  language  teachers.  However 
the  advent  of  a strict  behavior  theory  was  perhaps  the  precipitating  factor  in 
various  investigators  attempts  to  teach  some  kind  of  language  system  to  lower 
■T*’  ,pecifiCil,y'  chimps,.’  Nevertheless  i,  i,  noTcL  thaT  ZHZ 
tfwoiy  was  responnbie  for  the  successes  of  these  investigators,  such  as  they  have 
been.  The  Gardners  state  that  although  they  recognized  the  theoretical  weak- 
nesses of  the  behavioristic  paradigm,  they  “never  hesitated  to  apply  those 
principles  of  reward  theory  that  were  relevant,”  but  they  cite  a number  of  other 
teaching  techniques  (guidance,  observational  learning)  that  were  generally  more 
effective  than  straightforward  instrumental  conditioning  procedure,  (Gardner  A 
Gardner,  1971).  It  is  obvious  that  Rumbaugh  and  his  associates  and  the  Pre- 
macks  were  strongly  influenced  by  behavior  theory  in  their  work  with  animals, 
using  standard  instrumental  conditioning  techniques  at  least  in  the  earlier  phases 
of  their  work.  Nevertheless,  the  learning  b havior  of  the  animals  had  many 
features  that  could  not  have  been  expected  or  easily  accounted  for  by  behavior 
theoiy.  For  example,  Lana  (the  chimpanzee  taught  by  Rumbaugh  and  his 
associates)  would  every  once  in  a while  make  a mistake  while  she  was  punching  a 
sentence  into  the  computer;  all  by  herself  she  discovered  “correction  proce- 
dure for  canceling  the  input  of  such  a sentence  when  she  “knew”  she  had  made 
a mistake. 

h“Jlnspired  thc  nation  of  instructional  theorist,  who 
developed  ‘programmed  instruction”  (Glaser.  1965);  it  hi,  also  been  a source  of 
guidance  in  the  development  of  “behavior  modification”  techniques  for  chang- 
ing children  s language  behavior  (Hart  A Risley,  1974;  Sapon,  1969).  One  very 
expUat  use  of  behavior  theory  is  that  of  Wear  (IS71)  for  establishing  “receptive 
repertoires  in  children  learning  French. 

J Gardner  and  Gardner  (1971)  tau*ht  . version  of  American  Sign  Lansu.se  thc  sin 
tan*ui*eof  the  dear,  to  a female  chimpanzee  named  Washoe.  The  Premack,  (Premack 

iTJh  UU*hl  * chimP"uee  n»med  Sarah  to  sse  a “laiu^e"  in 

different  colon  and  shapes  were  used  to  commuSX.pl, 

*T(1973  ^ Rumbau«h-  Gil-  Brw".  ™ Glaaersfdd,  Pbani,  Warner  and 

XSifvX  „ ^ M ,0  * Un*Ulge  in  which  "•» 

* com^,«  used  to  control  the  disp  lay,  of 

“UM  h*  produced  dth«  by  “>«  experimenter  or  by  UnTby  the 
punching  of  buttons  in  the  proper  sequence.  ' 


1.  PROMOTING  LANGUAGE  SKILLS  15 


Thus,  behavior  theory  has  been  h'g'tly  successful  in  many  ways.  By  concentrat- 
ing on  directly  observable  events,  it  has  achieved  a kind  of  scientific  respectability 
that  was  not  achieved  by  previous  teaming  theories,  certainly  not  by  any 
kind  of  naive  theory.  More  importantly  for  our  present  purposes,  it  has  served  as 
a filtering  device  for  sorting  out  critical  elements  and  problems  in  learning 
theory.  But  in  this  respect  it  has  revealed  its  weaknesses.  There  is  a lingeiing 
appearance  of  circularity  in  a theory  of  reinforcement  that  seems  to  define 
reinfoxers  in  terms  of  drives  and  drives  in  terms  of  reinforcers,  but  there  ^e 
other  matters  to  worry  about.  The  maj  r gaps  in  the  theory  are  its  inability  to 
deal  with  covert  events  that  are  undoubtedly  relevant  in  learning  and  its  failure 
to  recognize  that  reinforcers  have  their  effect  not  on  responses  as  such  but  on 
the  covert  events  that  antecede  and  trigger  overt  responses.  It  has  no  satisfactory 
theory  of  knowledge  and  information  processing,  nor  of  the  parameters  of 
memory  structures  that  would  presumably  underlie  the  surface  “laws”  of  rein- 
forcement schedules.  From  a practical  viewpoint,  it  has  only  a limited  theory  of 
the  manner  in  which  responses  get  emitted,  so  that  the  practitioner  is  often  hard 
put  to  identify  or  elicit  responses  that  can  serve  as  a basis  for  further  learning. 
The  Gardners  might  have  had  to  wait  for  an  eternity  before  observing  responses 
that  could  serve  as  the  basis  for  communicative  “mands,”  if  they  had  not  in  the 
meantime  discovered  tha»  guidance  or  “molding”  (Foots,  1972)  could  shape 
such  responses.3  Fiom  the  reports  published  thus  far,  there  is  apprrently  no 
means  of  knowing  how  much  “guidance”  the  Premacks  had  to  give  their 
chimpanzee  Sarah  in  order  to  get  her  to  make  the  responses  she  did. 

Two  fundamental  questions  about  behavior  learning  theory  are:  (a)  does 
learning  truly  take  place  on  the  basis  of  solely  the  variables  indicated  by  the 
theory?,  and  (ft)  when  language  responses  are  acquired  or  modified  by  behavior 
modification  techniques,  is  this  learning  of  the  same  character,  resulting  in  the 
same  kind  of  competence,  as  occurs  in  normal  language  learning?  I believe  that 
the  answers  to  both  these  questions  are  in  the  negative.  The  bulk  of  the  evidence 
as  to  what  goes  on  in  the  so-called  “verbal  conditioning”  paradigm  is  that  a 
change  of  behavior  occurs  only  when  subjects  are  consciously  aware  of,  and 
pleasa.  tly  t'ispoted  towards,  the  arranged  contingencies  (e.g.,  Sallows,  Dawes,  A. 
Lichtenstein.  1971).  Weiss  and  Bom  (1967)  doubt  that  “speech  training”  con- 
ducted according  to  behavior  modification  theory  results  in  true  language 
acquisition 

Cognitive  Learning  Theory 

I am  not  aware  that  any  reasonably  adequate  cognitive  theory  of  learning  has  yet 
been  developed.  I would  entertain  the  hypothesis,  however,  that  such  a theory 

’A  mind,  according  to  Skinner's  (1957)  account,  is  a verbal  response  that,  though  initially 
occunir^  with  no  such  function,  has  been  conditioned  to  communicate  some  desire  or 
motive  (“demand"  “command")  on  the  part  of  the  learner. 
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would  provide  a much  improved  basis  for  interpreting  language  learning  phenom- 
ena and  for  suggesting  measures  for  promoting  language  skills.  By  “cognitive 
theory"  1 mean  a theory  that  would  embrace  covert  events  such  as  expectancies, 
plan-,  sets,  images,  memory  storage  and  retrieval,  conscious  control,  and  com- 
plex information  processing.  I assume  that  contemporary  experimental  technol- 
ogy fas  represented  for  example  by  reaction  time  studies,  computer  simulations 
of  behavior,  etc.)  has  means  for  elevating  these  concepts  to  scientific  respecta- 
bility. 

Let  us  see  how  a cognitive  theory  might  deal  with  the  major  p--nts  of  what 
have  called  a naive  learning  theory,  and  incidentally  how  it  would  reinterpret  the 
lands  of  observations  and  procedures  that  result  from  investigations  based  on 
behavior  theory. 


1 The  concept  of  “motivation"  would  be  translated  into  terms  of  various 
lands  of  internal  events.  Some  of  these  would  be  associated  with  basic  dnves, 
that  is,  covert  responses  to  changes  in  physiological  states;  others,  however 
might  be  labeled  as  conscious  goals,  plans  of  action  to  achieve  those  goals,  an 
expectancies  concerning  future  events,  often  in  cognitive  response  to  particular 
situational  requirements.  “Motivation  to  learn"  would  be  interpreted  as  an 
expectancy  of  some  future  state  of  knowledge  or  ability  that  would  result  from 
performance  of  a learning  task,  for  example,  the  ability  to  communicate  in  a 
foreign  language.  Certain  kinds  of  motivational  states  ( intentions ) would  have  a 
peculiar  importance  in  learning  language.  A communicative  act  involves  the 
transmission  of  certain  aspects  of  the  speaker’s  intentions  to  the  cognitive 
information  store  of  the  hearer.  There  is  at  least  inferential  evidence  for  the 
involvement  of  “intentions”  in  the  communicative  acts  of  the  chimpanzees  who 
have  been  studied  by  the  Gardners  and  by  Rumbaugh  and  his  colleagues.  Washoe 
communicated  her  desires  for  more  tickling,  more  banana,  etc.  by  using  the  sipi 
for  more  (Gardner  & Gardner,  1969,  p.  669).  Una  (Rumbaugh  et  al„  1973) 
ommunicated  her  desires  (intentions)  for  juice,  the  presence  of  her  keeper,  or 
even  background  music  by  various  button-pushings.  (Apparently  the  Premacks 
Sarah  was  never  given  the  opportunity  to  communicate  her  desires  ).  Mear  s 
(1971)  first-grade  students  learned  to  apprehend  the  intentions  of  their  teacher 
from  her  French  vocal  responses. 

2 In  contrast  to  behavior  theory,  but  in  agreement  with  naive  theory,  cogni- 
tive theory  would  acknowledge  the  importance  of  attention  in  learning.  Neisser 
(1967  pp  292  ff.)  writes  of  the  usefulness  of  assuming  an  “executive  process 
in  an  information-processing  theory  that  controls  the  flow  of  information  by 
addressing  particular  sensory  registers  or  memory  stores.  Cognitive  theory  wou.d 
assume  that  attention  is  important  at  some  point  in  the  learning  process,  even 
though  its  role  might  diminish  after  processes  become  automatized.  Cognitive 
theory  would  agree  with  naive  theory  in  asserting  that  “motivation"  (as  de- 
scribed above)  enhances  attention.  Expectation  of  reward,  for  example,  night 
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do  this;  problem  difficulty  would  also  do  so.  Rumbaugh  et  al.  (1973)  used 
expectation  of  reward  to  direct  Lana’s  attention  to  the  separate  parts  of  visual 
messages.  In  learning  the  conditional  relationship,  Sarah  is  reported  by  the 
Premacks  (1972)  to  hav,  ,>een  led  to  “pay  closer  attention  t he  sentences,” 
apparently  because  of  di  fficulty  experienced  with  the  problem. 

3.  Cognitive  theory  would  for  the  most  part  agree  with  naive  theory  in 
asserting  that  information  is  what  is  learned,  and  would  object  to  behavior 
theory’s  postulation  tbit  it  is  the  responses  that  are  learned.  According  to 
cognitive  theory,  learning  to  make  particular  responses  is  an  internal  process,  as 
is  also  a decision  to  emit  them  on  a particular  occasion  and  under  particular 
conditions.  Cognitive  theory  would  provide  for  the  automatization  of  response 
emission  by  assuming  that  information  transfer  processes  can  become  extremely 
rapid  and  that  cognitive  sets  are  not  necessarily  always  directly  under  the  control 
of  the  executive.  (In  fact,  an  important  feature  of  cognitive  theory  is  its 
emphasis  on  the  extreme  rapidity  of  most  cognitive  processes.)  Cognitive  theory 
would  further  agree  with  naive  theory  in  assuming  that  information  can  come 
from  a great  variety  of  sc  irees-through  any  sensory  modality,  but  it  would  lay 
stress  on  how  this  information  is  evaluated  and  possibly  transformed  by  the 
central  processor.  It  would  also  be  concerned  with  situational  contexts  in  which 
different  kinds  of  information  are  arriving  simultaneously  and  are  evaluate  ! in 
terms  of  each  other.  learning  the  meanings  of  signs  would  be  a special  case  of 
such  processing,  resulting  (under  suitable  conditions)  in  some  kind  of  awareness 
that  “X  means  Y.”  In  fact,  the  very  concept  of  naming  would  be  a special 
algorithm  used  in  processing  many  types  of  information.  Note  that  Sarah 
(Premack  & Premack,  1972)  was  able  to  learn  a sign  for  this  concept,  in  a sense, 
a second-order,  “metalinguistic”  concept.  (One  may  speculate,  therefore,  that 
the  Gardners  Washoe  a squired  this  concept  and  could  easily  have  learned  a sign 
for  it  if  the  proper  contingencies  had  been  arranged.) 

4.  In  cognitive  theoi^,  the  effects  of  practice  and  repetition  would  be  handled 
through  reference  to  the  parameters  of  various  memory  systems  and  to  the 
cognitive  states  occurring  during  practice  and  repetition.  It  would  be  an  interest- 
ing exercise  to  reexamine  the  extensive  literature  on  the  subject  from  this  point  of 
view.  In  this  way  it  might  be  possible  to  search  for  explanations  of  the  fact  that 
practice  and  repe,  n are  not  universally  effective  in  promoting  learning.  It  could 
be  hoped  that  eogn  ve  theory  would  extensively  refine  the  assertions  of  naive 
learning  theory. 

5.  Feedback  and  correction,  in  cognitive  theory,  would  be  regarded  as  merely 
one  kind  of  information  contributing  to  learning— though  frequently  ai!  impor- 
tant kind  of  information. 

6.  Rewards  and  reinforcers  (including  aversive  stimuli  and  their  withdrawal) 
would  merely  constitute  another  kind  of  information  utilized  in  producing 
learning,  but  rewards  would  be  neither  universally  necessary  nor  sufficient.  Their 
relevance  is  minimal,  for  example,  in  observational  and  incidental  learning. 
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7.  Previously  acquired  knowledge,  stored  in  something  other  than  short-term 
memory,  would  be  regarded  as  important  in  learning  to  the  extent  that  a 
particular  learning  process  required  use  of  that  prior  knowledge. 

8.  Active  learning”  might  be  important,  in  cognitive  theory,  to  insure  that 
knowledge  or  other  kinds  of  learning  are  truly  in  long-term  memory  and  not 
dependent  on  irrelevant  cues  from  short-term  memory. 

In  addition  to  all  these  points,  cognitive  learning  theory  would  lay  stress  on 
the  organism’s  interpretation  and  further  processing  of  the  information  available 
to  it  at  given  points  during  an  instructional  or  problem-solving  episode.  Particu- 
lar sequences  and  arrangements  of  stimuli  in  the  instructional  setting  would 
evoke  different  mental  processes,  some  being  more  conducive  to  learning  than 
others.  The  Premacks  (1972)  seemed  to  be  keenly  aware  of  this  in  their 
speculations  regarding  what  instructional  sequences  might  be  most  productive  of 
learning  in  Sarah.  In  teaching  language  concepts,  it  seemed  to  be  most  useful  to 
present  two  positive  instances  and  two  negative  instances.  One  may  hypothesize 
that  such  an  arrangement  provided  Sarah  with  precisely  the  information  that  was 
both  necessary  and  sufficient  to  define  the  concept.  The  common  features  in  the 
positive  instances  were  perceived  as  defining  the  concept  only  when  they  were 
seen  as  contrasting  with  the  common  features  in  the  negative  instances.  The  way 
in  which  information  to  be  processed  must  be  adequately  presented  is  also 
illustrated  in  the  teaching  of  the  if-then  conditional  sign.  Before  teaching  this 
sign,  it  was  necessary  to  establish  a referent  for  it,  namely  a set  of  situations  in 
which  a contingency  was  present.  The  Premacks’  teaching  of  the  conditional  sign 
to  Sarah  is  almost  precisely  parallel  to  the  manner  in  which  Bereiter  (Bereiter  & 
Engelmann,  1966)  taught  the  meaning  of  if  to  disadvantaged  children.  He  did 
this  by  setting  up  on  the  blackboard  several  situations  demonstrating  a contin- 
gency: If  a a square  is  red,  it  is  little;  if  a square  is  green,  it  is  big;  etc.  From  this 
it  was  easy  to  move  to  teaching  the  meaning  of  the  word. 


CAN  LANGUAGE  SKILLS  BE  PROMOTED?- 
ANSWERS  FROM  COGNITIVE  THEORY 

If  it  has  been  possible  to  develop  a “behavioral  technology”  based  for  the  most 
part  on  reinforcement  theory,  it  may  also  be  possible  to  formulate  a cognitive 
learning  technology,  with  a much  broader  base  in  information-processing  theory 
that  would  be  more  generally  applicable,  more  efficacious,  and,  perhaps,  more 
“humane”  than  behavioral  technology.  Such  a technology  would,  I believe,  be  a 
better  guide  to  the  promotion  of  language  skills. 

It  has  become  fashionable,  in  recent  years,  to  speak  of  language  acquisition 
rather  than  language  lean  ?,  at  least  in  reference  to  child  language.  Apparently, 
this  weasel  word  is  used  to  dodge  the  question  of  whether  1?  :guage  is  actually 
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“learned.”  Indeed,  it  has  been  suggested  that  it  is  “acquired”  through  some  sort 
of  “language  acquisition  device”  (LAD)  that  is  innately  specific  to  the  human 
species  (Lenneberg,  1967;  McNeill,  1970).  The  hypotheses  of  a language  acquisi- 
tion device  and  of  species  specificity  are  becoming  more  suspect  in  view  of  the 
recent  findings  with  Washoe,  Lana,  and  Sarah.  1 forego  discussion  of  whether 
these  animals  attained  systems  with  all  the  essential  properties  of  human  lan- 
Rua«  partly  because  the  data  are  not  all  in  (fo?  discussion  of  this  point,  see 
Bronowtki  & Bellugi,  1970;  McNeill,  1974).  We  have  little  information  as  to  the 
full  range  of  Washoe’s  comprehension  of  American  Sign  Language,  and  at  this 
writing  studies  with  Lana  and  Sarah  (or  their  friends)  are  still  in  progress.  On  the 
basis  of  my  analysis  of  instructional  episodes  with  these  animals,  I suggest  that  it 
should  be  possible  to  teach  chimpanzees  languages  more  closely  related  to  the 
human  language  than  those  thus  far  taught.  For  example,  Washoe  could  have 
been  taught  a sign  for  the  concept  name-of,  for  Sarah  learned  this  concept  quite 
readily.  Sarah,  in  turn,  could  possibly  have  been  taught  a language  system  with  a 
complexity  approaching  that  of  natural  language -rather  than  the  relatively 
“telegraphic”  syntax  that  was  taught. 

If  language  systems  cao  be  taught  to  primates,  it  would  seem  that  they  could 
certainly  be  taught  to  human  children,  but  ordinarily  one  does  not  think  of  any 
need  to  teach  a child  his  native  language,  since  he  seems  to  learn  it  by  himself  or 
herself.  Of  course  there  are  some  children  who  for  one  reason  or  another 
(deafness,  autism,  etc.)  do  not  “acquire”  language  in  the  normal  manner  and 
who  present  serious  learning  problems.  Possibly  a cognitive  learning  technology 
could  contribute  towards  the  solution  of  such  problems,  even  more  than  behav- 
ioral technology  has  already  contributed.  Even  in  the  case  of  “normal  children 
there  are  variations  in  rate  of  language  development;  we  know  very  little  about 
the  causes  of  such  variations.  To  the  extent  that  sue!  variation  might  be 
genetically  determined,  there  is  little  that  the  cognitive  learning  technologist  can 
do  about  them.  To  the  extent  that  they  might  have  environmental  antecedents 
(as  they  very  likely  do),  the  cognitivist  might  suggest  procedures  by  which 
retarded  development  could  be  remedied.  The  essential  need  at  this  time  is  to 
start  applying  cognitive  learning  theory  more  seriously  in  research  on  child 
language  learning  (and  I use  that  word  advisedly),  not  only  to  explore  possible 
applications  but  also  to  refine  the  theory  itself.  Similarly,  cognitive  learning 
theory  could  inspire  research  on  second-language  learning  (Carroll,  1974b). 

Several  lines  of  theoretical  and  empirical  investigation  may  be  suggested.  A 
further  analysis  of  the  experiments  with  animals  would  clarify  cognitive  language 
learning  theory  because  these  experiments  involve  organisms  that  do  not  ordi- 
narily possess  anything  like  human  language:  since  they  cannot  be  said  to  possess 
a language  acquisition  device  like  that  of  human  beings,  the  special  procedures 
that  have  been  used  to  teach  animals  language  must  exemplify  arrangements  that 
cause  learning  rather  than  a fulfillment  of  maturational  possibilities. 
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There  is  also  great  need  and  opportunity  to  reanalyze  and  reinterpret,  from  a 
cognitive  learning  standpoint,  findings  from  studies  of  child  language  acquisition. 
The  research  strategy  should  be  to  see  how  much  the  role  of  a language 

acquisition  device  can  be  delimited  and  how  much  the  role  of  learning  can  be 
amplified. 

Take,  for  example,  the  concept  of  imitation,  the  status  of  which  has  had  an 
interesting  history  in  the  study  of  child  language  learning.  Enshrined  as  an 
important  concept  in  naive  learning  theory,  and  interpreted  in  terms  of  “echoic 
operants  in  behavior  theory,  the  concept  has  generally  been  downgraded  in 
importance  by  specialists  in  child  language  study.  Ervin-Tripp  (1964)  at  one 
point  says,  “ . . . there  is  not  a shred  of  evidence  supporting  a view  that  progress 
toward  adult  norms  of  grammar  arises  merely  from  practice  in  overt  imitation  of 
adult  sentences  [p.  172] .”  Yet  Ervin-Tripp  and  others  (Slobin,  1968)  have  used 
imitation  tasks  extensively  to  study  grammatical  development.  The  problem  is 
partly  semantic:  on  the  one  hand,  “imitation”  can  refer  to  an  alleged  learning 
process;  on  the  other,  it  can  refer  to  art  observed  behavior.  But  the  problem  lies 
also  in  a confusion  about  what  is  imitated.  It  seems  almost  certain  that  children 
imitate,  or  try  to  imitate,  elements  such  as  intonation  patterns  or  single  words. 
They  can  also  imitate  longer  segments,  or  parts  of  these  segments,  but  only 
within  memory  limitations  and  the  competence  they  have  already  achieved  If 
the  concept  or  process  of  imitation  is  to  be  used  in  explaining  or  promoting 
anguage  acquisition,  it  must  be  considered  as  only  one  process  among  possibly 
many  others.  Bloom,  Hood,  and  Lightbown  (1974)  suggest:  “One  might  explain 
imitation  as  a form  of  encoding  that  continues  the  processing  of  information 
that  is  necessary  for  the  representation  of  linguistic  schemas  (both  semantic  and 
syntactic)  in  cognitive  memoiy  [p.  418] .” 

A further  analysis  of  imitation  in  terms  of  cognitive  theory  might  deal  with  the 
manner  in  which  iirsory  information  from  the  person  or  utterance  being 
imitated  is  transfer  ms  J into  memory  templates  and,  conversely,  how  memory 
templates  for  phonetic  material  are  manifested  in  motor  performance  (Posner  & 
Keele,  1973,  pp.  824-825).  Temporal  parameters  may  be  important  in  imita- 
tion. The  Gardners  (1971)  speculated  that  some  of  Washoe’s  learning  resulted 
from  what  they  called  delayed  imitation;  that  is,  Washoe’s  imitations  of  signs 
sometime,  did  not  occur  until  long  after  the  original  observations.  A process  of 
delayed  imitation  might  account  for  the  observation  that  children  sometimes 
come  out  with  a ne.'  word  or  grammatical  structure  “overnight,”  long  after 
original  exposure  to  models. 

Britton  (1970)  remarks  that  “It  would  seem  to  be  nearer  the  truth  to  say  that 
[children]  imitate  people’s  method  of  going  about  saying  things  than  that  they 
imitate  the  things  said  [p.  42] .”  Such  an  imitative  process  would  account  for 
improvisations  like  “I’m  spoonfulling  it  in”  or  “I’m  jumper  than  you  are,”  all 
based  on  the  imitation  of  speech  patterns  that  the  child  observes. 
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If  we  regard  language  production  as  a process  of  converting  intentions  into 
speech,  it  often  presents  features  of  problem  solving:  the  child  uses  whatever 
methods  he  may  have  acquired  that  seem  reasonable  in  this  kind  of  problem 
solving.  Promoting  language  skills  might  entail  teacldng  children  useful  methods 
for  expressing  their  intentions. 


SOME  BRIEF  BUT  DIFFICULT  PRESCRIPTIONS 

The  instructional  prescriptions  I have  to  offer  will  sound  rather  similar  to  those 
of  the  behavioral  technologists,  but  I hope  the  reader  will  appreciate  the  subtle 
but  essential  difference  in  theoretical  outlook. 

like  the  behavioral  technologists,  I recommend  careful  analysis  of  what  is  to 
be  leamed-usually,  analysis  into  rather  small  units,  hut  also  analysis  in  terms  of 
whatever  larger  structures  may  seeir  relevant.  The  analysis,  he 'ever,  is  to  be 
made  in  terms  of  information,  and  only  secondarily  in  terms  of  overt  responses 
to  be  made  on  th<  basis  of  that  information.  In  the  case  of  language  skills,  the 
analysis  of  informition  to  be  learned  will  have  much  to  say  about  the  stimulus 
conditions  that  co/respond  to  meanings  and  communicative  intentions,  and  the 
linguistic  constraints  whereby  those  meanings  and  communicative  intentions  are 
manifested  in  overt  behavior  in  a particular  language  system. 

In  the  preparation  of  instructional  materials  and  procedures,  careful  attention 
is  to  be  paid  to  the  manner  in  which  the  relevant  information  is  presented  to  the 
learner.  Account  must  be  taken  of  what  prior  information  can  reliably  be 
presumed  to  be  available  to  the  learner  at  any  given  point.  There  must  be  great 
concern  with  exactly  what  new  information  is  presented  from  moment  to 
moment  in  the  instruction,  with  reference  to  what  processing  of  that  informa- 
tion is  likely  to  be  performed  by  the  learner.  This  information  processing  should 
be  of  a nature  desired  by  the  instructor. 

In  the  actual  process  of  teaching,  the  learner  should  be  prepared  for  what  he  is 
learning  by  evoking  appropriate  sets  and  expectancies  that  will  direct  and  focus 
his  or  her  attention  on  particular  units  of  information.  Information  about  the 
manner  in  which  new  information  fits  in  with  the  overall  structure  of  what  is  to 
be  learned,  and  its  relevance  to  more  general  goals  of  the  learner,  would  be 
incorpoiated  in  the  learning  situation.  Instead  of  speaking  of  reinforcement,  we 
should  speak  of  the  role  that  certain  types  of  information  can  serve  in  directing 
the  cognitive  processes  of  the  learner  [support  for  this  type  of  prescription  can 
be  found,  for  example,  in  recent  papers  by  Bindra  (1974)  and  Boneau  (1974)] . 

The  planning  of  instructional  sequences  over  stretches  of  time  requires  consid- 
eration of  the  “cognitive  history”  of  new  information  in  terms  of  itc  probable 
course  through  various  memory  systems.  Although  as  yet  we  know  little  about 
the  properties  of  memory  systems,  an  idea!  cognitive  history  of  any  element  of 
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information  to  be  learned  might  be  something  like  this:  In  a first  phase, 
presentations  would  concentrate  on  obtaining  increased  clarity  and  definition  of 
the  learner’s  perceptions  of  stimulus  materials,  leading  to  a point  when  tne 
thing-to-be-leamed  receives  the  greatest  possible  attention  from  an  “executi’"” 
element.  In  a second  phase,  the  information  is  processed  through  short-term 
memory  and  eventually  into  long-term  memory,  passing  into  a state  where  it  no 
longer  needs  to  be  dealt  with  by  an  executive  in  the  focus  of  attention;  it 
becomes,  however,  more  and  more  readily  accessible  from  long-term  memory 
and  thus  acquires  a characteristic  of  automaticity. 

Throughout  this  discussion,  it  is  assumed  that  account  will  be  taken  of 
individual  differences  in  learners.  In  a recent  paper  (Carroll,  in  press)  I have 
suggested  that  individual  differences  in  the  performance  of  cognitive  tasks  are 
reflections  of  parameters  of  memory  stores  and  of  the  production  systems  that 
control  the  flow  of  information  in  a total  memory  model. 

I have  discussed  three  types  of  theory  that  might  apply  to  the  promotion  of 
language  skills.  Obviously,  I favor  cognitive  learning  theory.  I fear,  however,  that 
niy  formulations  will  remain  fanciful  until  they  prove  productive  of  improved 
instructional  outcomes. 


2 

Sources  of  Dependency 
in  Cognitive  Processes 

Robert  C.  Cal  fee 

Stanford  University 


For  some  years  I have  worked  at  untangling  and  measuring  independent  cogni- 
tive skills  in  beginning  reading  (Calfee,  in  press).  The  goal  of  this  work  was 
threefold:  (a)  to  create  a theoretical  model  (or  models)  to  describe  the  process 
by  which  the  ability  to  read  is  acquired,  ( b ) to  use  this  model  to  develop  a 
system  of  assessment  instruments,  each  providing  independent,  unique  sources 
of  information  to  the  classroom  teacher  and  other  individuals  responsible  for 
evaluation  of  a beginning  reader,  and  (c)  to  establish  the  feasibility  of  indepen- 
dent instructional  modules.  Given  solid  evidence  for  independent  stages  in  the 
acquisition  of  reading,  then  perhaps  these  can  be  handled  instructionally  as 
separate  matters,  contrary  to  the  current  practice  cf  trying  to  handle  everything 
at  once. 

Briefly,  my  previous  efforts  focussed  on  the  development  of  “clean”  tests,  in 
which  there  was  some  assurance  that  ancillary  tas  < requirements  (understanding 
instructions,  familiarity  with  materials)  were  elim  mated  as  differential  sources  of 
variability  between  children.  Multiple  regression  served  as  an  analytic  tool  for 
determining  the  independent  contribution  of  various  pmeursor  tests  for  predict- 
ing criterion  performance  in  reading  achievement  (Calfee,  1972;  Calfee,  Chap- 
man, & Venezky,  1972). 

More  recently  I have  been  thinking  about  the  general  question  of  what  is 
meant  by  independence  of  cognitive  processes,  and  the  related  question  of  how 
we  might  test  various  sources  of  independence.  Sternberg  (1969)  was  the  first  to 
point  out  the  central  importance  of  independence  to  information-processing 
models.  His  presentation  was  quite  clear  and  has  served  as  a basis  for  a great  deal 
of  fruitful  research  on  cognition.  However,  I now  realize  that  several  different 
interpretations  of  independence  have  been  intertwined  in  my  thinking  and,  1 
suspect,  in  the  thinking  of  other  investigators  as  well. 
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In  this  chapter,  I will  first  presmt  a generalization  of  Sternberg’s  additive- 
factor  paradigm  for  testing  stage  independence.  Next,  I will  turn  to  the  question 
of  how  to  evaluate  individual  differences  in  an  independent-process  analysis,  and 
wfil  present  a unified  framework  for  testing  different  classes  of  hypotheses  about 
the  independence  of  cognitive  processes.  An  illustration  of  these  techniques  will 
then  be  discussed.  Finally,  some  implications  of  this  work  for  test  design  will  be 
pointed  out. 


This  chapter  deals  with  assessment  of  instruction,  rather  than  with  methods  of 
instruction,  and  hence  is  most  readily  applicable  to  test  design  and  interpretation 
of  test  data.  But  assessment  is  intimately  interwoven  with  the  development  of 
substantive  theories  of  instruction.  A process-oriented  assessment  system  should 
help  us  understand  how  a student  thinks  when  he  is  learning  something.  This 
allows  us  to  formulate  reasonable  hypotheses  about  the  character  of  efficient 
instructional  strategies,  and  to  evaluate  the  effects  of  variation  in  instructional 


STERNBERG'S  ADDITiVE-FACTOR  PARADIGM 

It  has  been  the  fashion  for  the  past  several  years  among  cognitive  psychologists 
to  represent  theoretical  ideas  in  the  form  of  flow  charts  or  block  diagrams 
Sternberg  (1969)  pointed  out  that,  if  this  activity  was  to  be  taken  seriously  it 
was  necessary  to  demonstrate  the  functional  independence  of  the  processes 
represented  by  difrerent  blocks  in  the  system.  He  presented  a methodology  for 

owing  process  independence  for  the  case  of  a single  additive  measure,  reaction 
time. 

The  first  step  in  this  paradigm  is  the  analysis  of  the  underlying  cognitive 
operations  required  to  perform  a task.  This  provides  a rudimentary  information- 
processing model.  The  next  step  is  to  identify  one  or  more  factors  uniquely 
associated  with  each  operation.  Then  a procedure  is  developed  in  which  it  seems 
reasonable  to  suppose  that  the  operations  are  carried  out  as  a series  of  stages, 
one  following  the  other.  The  total  time  to  perform  the  task  is  the  sum  of  the 
times  taken  by  each  stage. 

For  example,  consider  a task  in  which  a subject  is  asked  to  read  a list  of  words, 
and  to  memorize  them  so  that  he  can  recall  them  after  a delay  interval.  The  list 
is  long,  and  during  the  delay  interval  the  subject  is  distracted  in  some  way,  so  the 
task  requires  more  than  short-term  memory.  The  subject  can  study  the  list  for  as 
long  as  he  wishes;  the  study  time  is  the  primary  dependent  measure. 

The  first  step  is  to  specify  the  mental  operations  required  to  perform  the  tasl.. 
The  model  in  Fig.  1 appears  reasonable  for  this  situation.  The  subject  uses  some 
time  to  read  each  word  in  the  list  and  some  more  time  adding  the  word  to  an 
organized  semantic  structure  which  aids  later  recall. 
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FIG.  1 Relation  of  factors  to  pi  "> cesses  in  "study”  model. 


The  next  step  is  to  identify  one  or  more  factors  that  should  have  a unique 
effect  on  a given  stage.  In  Fig.  1,  one  such  factor  is  suggested  for  each  stage.  We 
then  construct  a factorial  design  around  thes-,  variables;  a subject  is  given  word 
lists  containing  familiar  or  unfamiliar  word'  which  are  either  easy  or  difficult  to 
categorize. 

If  the  processes  in  Fig.  1 are  sequentially  independent,  and  if  the  assignment  ot 
factors  to  processes  is  appropriate,  then  a rigorous  test  of  the  model  is  possible, 
there  should  be  no  interaction  between  factors  associated  with  different  stages. 

This  conclusion  is  reached  as  follows.  Assume  it  takes  f seconds  to  read  a list 
of  familiar  words,  and  u seconds  to  read  a list  of  unfamiliar  words,  and  that/< 
u.  Similarly,  the  time,  e,  to  organize  an  easily  categorizable  list  is  assumed  to  be 
less  than  the  time,  d,  for  a list  that  is  difficult  to  categorize.  Then  the 
independent-process  hypothesis  predicts  that  for  each  type  of  list  specified  by 
the  design  factors,  study  time  should  be  the  sum  of  the  component  times.  The 
prediction  is  shown  in  Fig.  2 algebraically  and  graphically. 

An  observable  feature  of  this  prediction  is  that  the  data  should  trace  out 
parallel  functions.  The  effect  of  the  categorizability  factoi  should  be  the  same  at 
both  levels  of  the  familiarity  factor.  Any  other  result-any  deviation  from 
parallel  functions-is  evidence  of  an  interaction,  •,  dtich  would  mean  that  the 
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FIG.  2 Algebraic  and  graphical  predictions  based  on  “study"  model  with  two  independent 
processes. 
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theoretical  analysis  is  faulty  at  some  point.  In  this  event,  general  statements 
about  thn  effects  of  either  factor  are  impossible,  since  the  effects  of  one  factor 
vary  fro:  n level  to  level  of  the  other  factor. 


A GENERALIZATION  OF  THE  ADDITIVE-FACTOR  PARAD’GM 


A representative  of  a generalized  process  model  is  shown  in  Fig.  3.  Processes  A, 
B,  and  C are  assumed  to  be  cognitive  operations  underlying  the  performance  of 
some  task.  To  establish  the  independence  of  these  processes,  it  is  necessary  to 
associate  with  each  process  a factor  set  and  a measure  set.  A factor  set  consists 
of  one  or  more  independent  variables,  variation  in  which  is  presumed  to 
influence  the  corresponding  process  and  that  process  only.  A measure  set 
consists  of  one  or  more  dependent  variables,  each  of  which  reflects  the  operation 
of  the  corresponding  process  and  that  process  only.  In  other  w ords,  for  a process 
model  to  serve  any  useful  purpose  theoretically  or  practically,  we  ought  to  be 
able  to  specify  the  input-output  features  of  each  process  - what  sorts  of  variables 
affect  the  processes,  and  how  can  its  operation  be  measured?  If  every  factor 
interacts  with  every  other  factor,  and  if  we  have  no  clear-cut  way  of  measuring 
the  underlying  processes,  so  that  every  measure  correlates  with  every  other 
measure,  we  have  gained  little  understanding  no  matter  how  elaborate  our  flow 

^How  i:  a model  like  that  in  Fig.  3 to  be  tested?  It  requires  a multifactor 
experiment  with  multivariate  measures,  in  which  each  subject  is  tested  under  a 
variety  of  combinations  of  factors  from  each  of  the  factor  sets,  and  a variety  of 
measures  taken  under  each  combination  to  provide  links  vo  each  component 
process.2 

Throughout  this  chapter  we  consider  only  designs  with  two  processes,  two 
two-lever  factors  in  each  set,  and  a single  measure  for  each  process.  This  implies 
a 24  design,  in  which  each  subject  is  tested  16  times,  once  on  each  ofthe 
factorial  combinations.  Two  measures  are  taken  under  each  combination.  Only 
main  effects  and  two-way  interactions  are  discussed  since  these  suffice  to  test 
the  model  and  to  describe  fully  the  operation  of  each  process. 


•The  term,  process,  is  used  extensively  and  more  or  less  uncritically  throughout  this 
paper  to  refer  to  a mental  operation  of  some  kind.  Stage  has  been  avoided  because  of  the 
possible  confusion  with  developmental  stages.  Process  independence  is  a property  of  a 
particular  task  for  subjects  of  a given  sort.  There  is  no  effort  to  deal  with  the  question  of 
whether  “independent  processes”  might  be  structural  in  nature,  the  result  of  learning,  or 
situation  specific.  Finally,  process  independence  does  not  imply  instructional  independence, 
although  as  suggested  earlier  this  is  a possibility  worth  pursuing. 

J The  present  proposal  is  intended  only  as  a generalization  of  Sternberg  s ideas,  not  a 
replacement,  in  particular,  single-measure  analysis  remains  an  important  technique  for 
investigation  of  process  independence.  This  includes  additive  measures  like  reaction i hme, 
but  might  be  usefully  extended  to  multiplication  measures  like  proportions  (Calfee,  197U). 
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FIG.  3 A generalization  of  the  independent-process  model.  Associated  with  each  compo- 
nent process  is  a set  of  factors  and  a set  of  measures,  each  assumed  to  be  uniquely  linked  to 
the  process. 

The  details  of  the  design  are  spelled  out  in  Fig.  4.  The  sixteen  cells  are  labeled 
according  to  the  four  factors,  two  in  each  factor  set.  Belc  w the  design  are 
contrast  coefficients  for  the  computation  of  the  linear  contrast  for  that  source. 
These  will  be  discussed  shortly.  Below  that  the  data  are  represented  hi  a general 
way.  The  indices  / to  / serve  as  usual  to  denote  levels  of  the  factors  A through  Bl . 
The  m and  n indices  denote  a particular  measure  ( a or  b corresponding  to 
process  A or  B,  respectively)  and  subject. 

In  Fig.  5 is  the  general  linear  factorial  model  for  the  design.  Each  observation 
is  fully  accounted  for  by  this  set  of  parameters.  The  methods  of  estimating  the 
parameters  is  well  known,  and  will  not  be  dealt  with  here  in  any  detail. 

There  are  several  ways  to  carry  out  an  analysis  of  variance  for  the  data  set  in 
Fig.  4 based  on  the  model  in  Fig.  5.  The  most  convenient  method  for  present 
rvirposes  is  based  on  linear  contrasts  (Dayton,  1970,  pp.  37-48,  78-81,  256— 
268).  It  is  possible  to  express  the  magnitude  of  each  source  as  a one-degree-of- 
freedom  linear  contrast  on  the  data.  For  instance,  the  A source  (the  main  effect 
of  the  A factor)  is  the  difference  between  the  A i and  A3  scores;  the  A'  source  is 
the  difference  between  A,'  and  Aj',  and  so  on.  These  contrasts  are  repre- 
sented by  the  corresponding  sets  of  coefficients,  c,  in  Fig.  4.  In  a factorial 
design,  the  two-way  interaction  between  factors  is  the  crossproduct  of  the  two 
sets  of  coefficients.  The  coefficients  for  the  AA'  interaction  source  in  Fig.  4 
were  generated  in  this  fashion . 

The  contrast  coefficients  are  used  to  calculate  a set  of  orthogonal  parameter 
estimates  from  each  subject’s  raw  data.  The  magnitude  of  source  S for  a given 
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FIG.  5 The  general  linear  factorial  model  for  the  design  in  Fig  4. 


measure,  m,  and  a particular  subject,  n,  is  computed  from  the  contrast  coeffi- 
cients for  that  source  and  the  set  of  observations  on  the  given  measure  for  that 
subject: 


For  2P  designs  like  the  one  under  discussion,  the  numerator  of  each  contrast  is 
a simple  difference  score;  the  denominator  normalizes  the  expression  so  that 
regardless  of  the  choi^i  of  coefficients  the  variance  of  the  contrast  is  equal  to 
the  population  variance  under  the  null  hypothesis. 

There  is  a direct  correspondence  between  the  variance  estimate  of  a source  by 
means  of  a linear  contrast  and  the  parameters  from  the  linear  model  (Fig.  5),  the 
latter  serving  often  to  teach  analysis  of  variance  in  statistics  courses.  The 
variance  estimate  for  source  5 over  subjects,  MS  ( S ),  is  based  on  the  average  of 
the  corresponding  contrasts,  C3m..  This  average,  squared  and  multiplied  by  the 
number  of  subjects,  is  equal  to  the  MS(S).  If  the  null  hypothesis  holds  for  source 
S then  MS(S)  is  an  estimate  of  the  population  variance.  The  residual  variance  in 
the  contrast  scores  for  source  5 provides  a second  estimate  of  the 
population  variance.  The  two  variance  estimates  generate  an  F ratio  to  test  the 
plausibility  of  the  null  hypothesis. 

The  point  to  emphasize  here  is  that  the  linear  contrast  provides  a convenient 
method  for  representing  each  independent  parameter  estimate  in  the  general 
linear  factorial  model  in  Fig.  5.  The  procedure,  in  its  essentials,  is  to  compute 
each  estimate  in  the  form  of  a mnnelized  difference  score  for  each  subject.  The 
analysis  of  variance  becomes,  to  a-'  tents  and  purposes,  an  orthogonal  collec- 
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don  of  t tests  on  difference  scores.  This  method  is  algebraically  equivalent  to  a 
conventional  repeated-measures  analysis  of  va.iance.  1 am  overlooking  the  use  of 
multivariate  analysis  of  variance  as  an  alternative  method  of  analysis,  as  well  as 
questions  about  the  dangers  of  relying  on  the  acceptance  of  the  nuil  hypothesis 
as  a way  of  supporting  a substantive  hypothesis.  These  are  matters  of  some 
concern,  but  they  have  been  discussed  elsewhere  and  are  not  central  to  the 
problem. 


Process  Independence— On  the  Average 


The  major  prediction  of  an  independent-process  model  for  data  like  that  de- 
scribed in  Fig.  4 is  straightforward.  The  factors)  associated  with  a given  process 
can  affect  only  the  measure(s)  associated  with  the  process.  No  other  sources  of 
variance  should  be  substantial.  The  details  are  indicated  to  the  right  of  Fig.  5. 
Variation  in  Factor  A or  A'  or  the  interaction  AA'  might  be  expected  to 
substantially  (and  significantly)  affect  measure  a\  these  sources  should  not  have 
any  noticeable  effect  on  measure  b.  A similar  state  of  affairs  holds  for  factors  B 
and  B'  with  regard  to  measure  b.  Any  interaction  between  the  two  factor  sets  is 
evidence  against  the  independence  of  the  processes,  no  matter  which  measure  is 
affected. 

A concrete  example  may  be  useful  at  this  point.  This  study  (after  Floyd, 
1972)  is  designed  to  investigate  the  processes  by  which  young  children  read 
single  words  presented  in  isolation. 

The  model  for  this  task  is  shown  in  Fig.  6.  Two  processes  are  proposed: 
decoding  and  semantic  matching.  Reading  is  conceived  as  an  initial  translation  of 
the  printed  word  into  an  auditory  form,  then  a search  in  memory  for  a lexical 
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FIG.  6 Example  of  a two-process  model  for  reading  a word  in  isolation.  (After  Ei^d, 
1972.) 
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match  For  each  process,  a factor  set  and  a measure  set  are  suggested.  The 
sTmu'us  Ids  comprise  factorial  combinations  of  all  the  factors  shown^Th 
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measure  of  the  magnitude  of  this  source  over  subjects.  The  residual  variance 
between  subjects  in  the  contrast  yields  a measure  of  error  variance  or  a 
statistical  significance.  The  ratio 

MS( vowel  vnmpievity  for  pronunciation) — _ 

F ~MS(subjects  by  vowel  complexity  for  pronunciation) 

if  statistically  significant,  would  fit  the  hypothesis  of  process  independence.  The 
ratio 
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and  useful  way  of  understanding  how  a subject  performs  the  task. 

Process  Independence-Individual  Differences 

The  orecedine  test  of  process  independence  involved  comparison  with  an  error 

^ Estimate  based  on  individual  differences  in  subjects  performance  on  a 
vanance  esfmate  based  customary  research  on  cognition  to  treat 

particular  contrast.  While  it  is  cu  o ly  & Lunneborg,  1973,  and  Carroll, 
individual  differences  as  err  ( ’ 19’69  307-308),  this  is  not  an 

1976  are  exceptions'  also,  cf.  Sternberg,  vv 

adequate  ucatment  for  educational  research  and  pracUce. 
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Several  distinctive  sources  of  individual  differences  are  represented  in  the 
general  linear  factorial  model  (Fig.  5).  Half-way  down  the  list  are  parameters, 
vm  „ which  measure  subject  n's  general  performance  level  for  measure  m, 
averaged  over  all  factorial  combinations.  Below  that  are  subject-factor  param- 
eters for  factor  set  A,  (at,)/,m,n<  etc->  next  are  corresponding 

parameters  for  factor  set  B,  and  nally  the  parameters  for  the  interactions  of 
these  sets.  As  indicated  in  the  figure,  if  the  independent-process  hypothesis  is 
correct,  only  certain  of  these  parameters  should  produce  substantial  variance 
estimates. 

For  instance,  suppose  that  for  certain  subjects  factor  A had  a large  effect  on 
measure  b,  contrary  to  the  process  independence  hypothesis,  whereas  for  other 
subjects  this  effect  was  negligible.  Then  MS( subjects  by  A for  measure  b)  would 
be  relatively  large.  MS(A  for  measure  b),  which  represents  the  general  effect  of 
the  A factor  on  measure  b,  might  be  nonsignificant  when  compared  to  ^(sub- 
jects by  A for  measure  b).  Acceptance  of  the  null  hypothesis  might  be  taken  as 
evidence  in  support  of  general  process  independence-an  erroneous  conclusion, 
at  least  for  some  subjects. 

Large  variation  between  subjects  in  the  parameters  for  a given  source  may 
compromise  the  interpretation  of  the  overall  variance  source.  The  most  obvious 
danger  is  that  an  unduly  large  error  variance  estimate  may  obscure  evidence 
contrary  to  the  independent-process  hypothesis.  In  this  regard,  comparison  of 
variance  components  provides  a useful  supplement  to  significance  tests  in  the 
examination  of  data. 

Under  certain  conditions  it  is  possible  to  test  the  hypothesis  that  a subject- 
factor  variance  estimate  is  larger  than  expected.  The  design  must  permit  the 
estimation  of  a residual  variance  term;  replication  within  subjects  or  pooling  of 
high-order  interactions  often  serves  this  purpose.  The  test  compares  each  subject- 
factor  source  with  the  residual  error  variance.  For  example,  if 


F- 


MS(sub|ects  by  vowel  complexity  for  pronunciation) 
MS(residual  error) 


is  a significant  source  of  variance,  this  is  compatible  with  the  independence 
hypothesis.  On  the  other  hand,  the  finding  that 


F = 


MSfsubjects  by  word  familiarity  for  pronunciation) 
MS( residual  error) 


is  highly  significant  constitutes  evidence  contrary  to  the  hypothesis.  Such  tests 
are  quite  sensitive  because  of  the  large  number  of  degrees  of  freedom  for  each 
variance  estimate. 

Subject-factor  sources  may  provide  the  strongest  evidence  for  or  against 
process  independence.  If  MS( subjects  by  vowel  complexity  for  pronunciation)  is 
large,  then  MS(vowel  complexity  for  pronunciation)  will  seem  relatively  small, 
and  may  be  insignificant.  Such  a result  does  not  mean  that  vowel  complexity  has 
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no  effect  on  the  decoding  process,  but  rather  that  the  magnitude  of  that  effect 
varies  widely  from  subject  to  subject,  in  ways  that  are  not  controlled  by  the 
between-subject  design.  Similarly,  Af5(won;  familiarity  by  pronunciation)  might 
be  insignificant  when  tested  against  A/S(subjccts  by  word  familiarity  for  pronun- 
ciation). But  if  the  latter  variance  i:  'arge  relative  to  AfS(residual  error),  this  is 
evidencj  contrary  to  process  independence,  ju.it  as  surely  as  a large  average  effect 
is  contrary  evidence. 

Modification  of  a Model 

The  preceding  discussion  of  statistical  “tests”  may  imply  a destructive  approach, 
in  which  a model  is  proposed  and  then  all  efforts  are  directed  toward  question- 
ing its  adequacy.  In  fact,  the  factor-process  measure  approach  is  self-correcting 
in  the  development  of  a model.  Examination  of  a series  of  experiments  provides 
positive  information  about  the  character  of  underlying  processes,  the  specifica- 
tion of  useful  correspondences  between  factors  and  processes,  and  the  descrip- 
tion of  factors  and  measures  in  a precise,  unconfounded  manner.  The  results  of 
each  experiment  lead  to  “perfecting”  modifications  in  the  basic  model,  which 
can  be  subjected  to  further  test. 

Parameter  Independence 

To  this  point,  independence  has  referred  to  the  absence  of  interactions  between 
factors  associated  with  different  processes.  Closer  examination  of  the  question 
of  individual  differences  reveals  the  existence  of  another  type  of  independence, 
namely,  the  extent  to  which  the  parameters  of  the  general  linear  factorial  model 
are  correlated.  This  property  of  a data  set  will  be  called  parameter  indepen- 
dence. 3 

The  idea  of  looking  at  the  between-subjects  correlation  between  a pair  of 
analysis-of-variance  parameters  is  somewhat  unconventional,  but  this  appears  to 
be  a reasonable  question  to  raise  of  a data  set.  Consider  the  linear  contrasts 
CA  an  and  CA\a,n  -these  are  difference  scores  for  the  A and  A'  factors  for 
measure  a,  calculated  for  each  subject  n.  Imagine  that  these  pairs  of  scores  are 
arranged  in  a scatterg.am.  The  previous  analyses  have  dealt  with  the  marginal 
distributions,  asking  whether  the  marginal  means  are  zero,  and  whether  the 
variance  around  each  marginal  mean  is  comparable  in  magnitude  to  an  estimate 
of  the  population  variance. 

The  size  of  the  correlation  between  CA  at„  and  CA'iaMi  is  therefore  a new 
question,  and  statistically  independent  of  the  previous  questions  asked  of  the 

’To  the  best  of  my  knowledge,  examination  of  the  specifics  of  a variance-covariance 
matrix  along  the  tines  suggested  below  has  not  been  suggested  before.  Test  for  homogeneity 
of  the  matrix  is  a crude  effort  at  best.  Multivariate  analysis  of  variance  is  mainly  concerned 
with  appropriate  statistical  inference  when  the  dependent  variable  is  a vector. 
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data  A large  correlation  would  mean  that  subjects  who  are  strongly  affected  by 
variation  in  factor  A are  also  strongly  affected  by  factor  A , and  contrariwise.  If 
the  correlation  is  negligible,  then  the  effects  of  the  two  factors  are  independent 
of  each  other,  in  the  sense  that  the  knowledge  that  a subject  is  strongly  affected 
by  variation  in  one  factor  says  nothing  about  his  or  her  reaction  to  another 

factor. 

In  the  preceding  example,  the  contrasts  CAa,n  and  CU’.o.n  estimate  param- 
eters for  factors  A and  A'  which  linked  to  the  same  process.  This  analysis  will  be 
referred  to  as  a test  of  intraprocess  parameter  independence.  One  can  look  at 
correlations  with  interaction  contrasts  as  well  as  comparing  main  effect  con- 
trasts. For  instance,  the  correlation  between  CAt0i„  and  CAA\a,n  asks  whether 
the  size  of  the  effects  of  variation  due  to  factor  A are  correlated  with  the 
magnitude  of  differential  effects  of  A at  the  two  levels  of  A It  should  be 
stressed  that  the  correlation  between  contrasts  is  not  the  same  as  the  interaction 
between  factors.  Moreover,  one  may  examine  these  correlations  regardless  of  the 
outcome  of  the  analyses  of  the  marginal  distributions. 

It  is  also  possible  to  examine  the  between-subject  correlation  of  contrasts  for 
sources  from  two  different  processes:  CA,a,n  and  cB.b,n,  for  example.  It  is 
consistent  with  the  process-independence  hypothesis  that  both  of  these  sets  ot 
contrasts  could  be  significant  sources  of  variance,  either  on  the  average  or  as 
subject-factor  interactions;  factor  A is  linked  with  measure  a,  and  factor  B with 
measure  b.  The  magnitude  of  the  correlation  between  contrasts  is  a separate 
questio*  and  has  no  bearing  on  process  independence.  If  two  such  contrasts  are 
highly  . -elated,  it  means  that  a subject  who  is  strongly  affected  by  a factor  in 
one  process  is  likely  to  be  strongly  affected  by  another  factor  linked  to  a secon 
process,  whereas  a subject  showing  little  effect  of  one  factor  would  not  be  much 
affected  by  variation  in  the  other  factor.  This  will  be  called  an  analysis  of 
interprocess  independence. 


General  Parameter  Independence 

There  are  two  other  types  of  independence  to  be  considered  in  examining 
individual  differences.  These  are  measured  by  the  correlation  (a)  between  general 
parameters,  each  based  on  the  average  for  a given  measure  over  factorial 
conditions  for  an  individual  subject,  and  (b)  between  general  parameters  and 
specific  contrast  parameters.  If  you  refer  to  the  general  Unear  factorial  model 
(Fig.  5),  the  first  correlation  is  between  the  estimates  of  va  n and  vb  „.  This  is 
the  correlation  between  the  average  scores  for  different  dependent  variables, 
which  is  frequently  calculated  by  researchers.  I wiU  refer  to  this  as  independence 
of  general  parameters. 

The  correlation  between  general  and  specific  parameters  has  been  examined 
less  often.  It  conasts  of  the  comparison  of  terms  like  va<n  a--d  Actually 


2.  COGNITIVE  DEPENDENCY  35 


for  two-level  factors  the  correlation  of  ua  n and  q(j„  is  equivalent  to  the 
preceding  correlation,  and  perhaps  is  a bit  easier  to  grasp.  The  question  raised 
here  is  whether  subjects  who  do  better  on  the  average  lifer  all  conditions  also 
tend  to  be  more  strongly  affected  by  factor  variations.  Fo.  . .ample,  the  student 
who  pronounces  words  quite  well  on  the  average  is  strongly  affected  by  variation 
in  vowel  complexity,  whereas  the  student  whose  pronunciation  is  generally  poor 
does  about  the  same  whether  the  words  contain  simple  or  complex  vowels.  1 will 
refer  to  this  analysis  as  a test  of  the  independence  of  general  and  specific 
parameters. 

Sources  of  Dependency:  An  Overview 

It  should  be  emphasized  that  the  different  types  of  independence  described 
above  are  statistically  separate,  and  that  the  answer  to  one  question  does  not 
directly  determine  answers  to  any  other.  That  is,  one  can  usefully  inquire  about 
each  of  the  following  substantive  questions: 

1 . Process  independence,  average  over  subjects: 

Are  any  between-process  sources  of  variance  so  large,  on  the  average,  that 
the  hypothesis  of  process  independence  is  untenable? 

2.  Process  independence,  subject-factor  interactions: 

Are  any  between-process  subject-factor  interactions  so  large  that  the 
hypothesis  of  process  independence  is  untenable? 

3.  Intraprocess  parameter  independence: 

Are  effects  of  within-process  factors  correlated? 

4.  Interprocess  parameter  independence: 

Are  effects  of  between-process  factors  correlated? 

5.  General  parameter  independence: 

Are  total  scores  for  different  measures  correlated? 

6.  General-specific  parameter  independence: 

Are  the  specific  effects  of  process  factors  correlated  with  generalized 
performance  as  measured  by  total  scores? 

The  answers  to  these  different  questions  carry  different  implications.  Ques- 
tions (1)  and  (2)  bear  on  the  adequacy  of  a proposed  information -processing 
model.  Questions  (3)  and  (4)  have  to  do  with  the  degree  to  which  individual 
subjects  are  more  or  less  generally  labile  in  reaction  to  factor  variation.  Ques- 
tions (5)  and  (6)  deal  with  the  relation  of  general  performance  and  process- 
linked  shifts  in  performance. 

If  process  independence,  (1)  and  (2),  is  supported  by  the  data,  this  is  evidence 
that  assessment  (and  pjssibly  instruction)  may  proceed  by  investigation  of  each 
process  as  a separable  entity.  For  instance,  suppose  decoding  and  semantic 
matching  operated  as  independent  processes  in  a series  of  experiments.  Then  it 
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might  be  reasonable  to  design  assessment  and  instructional  programs  that 
focussed  s.  icifically  on  decoding  skills,  with  minimal  concern  about  the  corre- 
sponding comprehension  processes,  and  vice-versa. 

If  answers  to  Questions  (3)  through  (6)  reveal  frequent  and  marked  depen- 
dencies, this  supports  a “G  factor”  interpretation  of  individual  differences  in 
cognitive  processes  for  the  task.  If  strong  correlations  hold  between  measures  in 
different  tasks,  there  is  little  need  for  extensive  assessment  of  an  Individual 
student.  Administration  of  a few  “subtests”  will  indicate  the  student’s  general 
level  of  performance,  or  his  reaction  to  factor  variation,  or  both.  From  this  we 
can  predict  his  performance  under  other  conditions.  On  the  other  hand,  if 
dependencies  are  negligible,  the  development  of  comprehensive  assessment  sys- 
tems becomes  a worthwhile  endeavor. 


AN  EXPERIMENT  ON  LINE  DRAWING 


1 


X 


I 


Here  is  an  illustration  of  how  to  apply  these  techniques  to  a data  set.  The  study 
was  not  designed  to  test  an  independent-process  model,  and  it  seems  unlikely 
that  the  treatment  factors  are  uniquely  linked  to  underlying  processes.  But  the 
within-subjects  portion  of  the  design  raises  interesting  questions,  and  the  data 
were  readily  available  for  the  analysis. 

The  study  was  part  of  an  investigation  of  impulsivity-reflectivity  in  young 
children  (Kagan,  Rosman,  Day,  Albert,  & Phillips,  1964).  Some  children  seem  to 
attack  a problem  impulsively -more  quickly  and  with  a dgher  error  rate.  Others 
tend  to  work  reflectively— more  carefully  and  accurately  There  is  some  evidence 
that  impulsivity  is  correlated  with  poorer  reading  achievement.  Our  particular 
interest  was  in  determining  the  extent  to  which  speed  and  accuracy  measures 
were  affected  by  situational  variables  in  a simple  motor  task. 

The  children  were  shown  a paper  with  half  a dozen  items  like  the  ones  in  Fig. 
7,  and  told  that  their  job  was  to  draw  a line  from  each  rabbit  along  the  “road” 
to  the  carrot  without  touching  the  lines.  Time  to  complete  all  six  items  on  a 
page  was  measured,  as  were  the  total  number  of  line-touching  errors.  The 
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FIG.  7 Example  of  maierials  used  in  line-drawing  lask. 
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FIG.  8 Information-processing  model  for  line-drawing  experiment. 

children,  first-graders  and  kindergartners,  were  tested  twice  with  three  months  or 
so  between  sessions. 

The  two  within-subject  factors  of  primary  interest  are  related  in  Fig.  8 to  a 
tentative  processing  model.  One  process  determines  rate  of  movement,  and  the 
second  process  determines  accuracy.  Latency  and  errors  seem  natural  measures 
for  these  processes. 

The  Set  factor  describes  the  instructional  conditions  under  which  the  child 
performed  the  task.  The  first  two  pages  were  always  done  with  no  set.  The 
instructions  emphasized  neither  speed  nor  accuracy:  “Draw  a line  down  each 
road  from  each  bunny  to  his  canot.  Try  not  to  touch  the  sides  of  the  road.  If 
you  do  touch  the  side  of  the  road,  it’s  okay,  keep  going,  but  try  not  to  touch  the 
lines.”  On  the  next  two  pages,  accuracy  was  stressed:  “Be  very,  very  careful  not 
to  touch  the  sides  of  the  road.”  Finally  on  the  last  two  pages,  the  child  was 
asked  to  draw  as  fast  as  possible:  “Get  each  bunny  to  his  carrot  as  quickly  as 
possible.  Try  not  to  touch  the  sides  of  the  road,  but  if  you  Jo,  it  doesn’t  matter, 
the  important  thing  is  to  complete  the  page  as  quickly  as  possible.”  Set  and 
order  are  confounded  in  this  design  as  a matter  of  practical  necessity. 

The  Difficulty  factor  denotes  whether  each  page  had  easy  items  (the  lines 
connecting  rabbit  and  carrot  were  5/8  inch  apart)  or  difficalt  items  (the  lines 
were  1/4  inch  apart). 

For  purposes  of  analysis,  the  Set  factor  has  been  identified  with  the  timing 
process,  and  the  Difficulty  factor  with  the  movement  process.  This  linkage  is  not 
really  satisfactory,  as  noted  earlier.  Each  factor  seems  likely  to  affect  both 
processes  as  constituted.  Nonetheless,  let  us  see  what  the  analysis  tells  us  about 
the  data. 

Analysis  of  Process  Independence 

Univariate  analyses  of  variance  were  carried  out  on  the  two  measures  for 
preliminary  statistical  evaluation;  these  are  presented  in  Table  1.  In  Fig.  9 time 
and  error  scores  are  shown  as  a function  of  Grade,  Set,  and  Difficulty.  Grade, 
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TABLE  1 

Analysis  of  Variance  (islacted  lourcet!  of  Line-Drawing  Experiment 


— 

Time  (sec) 

Errors 

Source 

F 

AfS(E) 

F 

AfS(E) 

Grade/ Age  ( G ) 

5.6* 

- 

13.2* 

- 

Ability  (A) 

<1 

- 

<1 

G X A 

~1 

- 

~1 

N(GA) 

- 

1743. 

— 

45.9 

Instructional  set  (/) 

No  set  versus  accuracy  + speed  (/, ) 

44.8** 

247.9 

21.5** 

4.7 

Accuracy  versus  speed  (/, ) 

113.2** 

449.5 

47.7** 

9.6 

Difficulty  (D) 

188.9** 

416.6 

76.0** 

24.9 

Session  (5) 
/,xfl 

35 

677.5 

<1 

12.6 

20.7** 

108.7 

22.1** 

3.9 

1,XD 

64.0** 

78.7 

39.0** 

5.2 

/,XS 

1.7 

479.8 

<1 

3.0 

/,XJ 

3.2 

326.3 

<1 

2.5 

DyS 
GX  I, 

3 7 

275.6 

<1 

9.7 

<1 

247.9 

9.1** 

4.7 

GX/, 

2.5 

449.5 

9.5** 

9.6 

GYD 

<1 

416.6 

13.1** 

24.9 

GYl,YD 

2.7 

108.7 

8.6** 

3.9 

GX/.XD 

Residual 

<1 

78.7 

5.9** 

5.2 

154.0 

- 

2.5 

“df  for  all  tests  are  1 and  36. 

*F(1,  36,  .05)  = 4.12.  **F(  1,  36,  .01)  = 7.40. 


the  two  Set  contrasts  and  Difficulty  are  ali  significant  sources  of  variance  for 
both  measures,  as  are  certain  interactions  among  these  factors.  Instructions  o 
“be  more  accurate”  slow  the  children  down  a little,  without  any  noticeable 
decrease  in  errors.  Instructions  to  "speed  up”  are  obeyed  by  the  children  (and 
happily  so),  but  with  a marked  increase  in  errors.  Difficulty  has  a large  effect  on 
both  time  and  errors.  The  children  take  much  lonr  er  to  connect  the  rabbit  and 
carrot  when  the  fines  are  close  together,  but  they  Jso  make  abater  number  of 
errors  under  this  condition.  The  interaction  Ktween  die  Set  and  ^ feu  y 
factors  can  be  traced  to  the  speed  instruct*  ,is.  The  effect  of  the  Difficulty 
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FIG.  9 Time  and  error  scores  as  a function  of  Grade,  Set,  and  Difficulty,  averaged  over 
sessions.  Easy  items  are  open  circles.  Hard  items  are  filled  circles.  N = 20  in  each  group. 

factor  is  greater  for  errors  and  smaller  for  response  time  under  speed  instructions 
compared  to  the  other  two  instructional  conditions.  The  kindergartners  make 
more  errors  than  first  graders,  especially  in  the  difficult  condition  and  under 
instructions  stressing  speed. 

On  the  average,  then,  the  students  performed  the  line-drawii.  ,k  fairly 
efficiently  without  explicit  instructions  about  how  to  arrangf  trade  off 
between  speed  and  accuracy.  They  worked  about  as  slowly  as  they  felt  they 
could,  and  speeding  up  led  to  an  increase  in  errors. 

Certain  of  the  subject-factor  variance  estimates  are  substantially  (and  signifi- 
cantly) larger  than  the  residual  variam  e based  on  the  highest-order  interaction. 
There  are  large  individual  differences  in  the  time  measure  due  to  variation  in 
accuracy  vs.  speed  and  difficulty,  as  well  as  variation  from  the  first  session  to  the 
second.  Variation  in  difficulty  is  the  largest  source  of  individual  difference  in  the 
error  scores,  followed  by  session  and  accuracy-speed. 

The  process  model  in  Fig.  8 must  be  rejected  on  several  grounds.  Both 
measures  are  strongly  affected  ly  both  factors,  and  the  interaction  between  the 
two  factors  is  significant.  The  large  subject-factor  variance  in  time  due  to 
variation  in  difficulty  is  also  evidence  contrary  to  the  model. 

These  findings  suggest  that  either  (a)  the  two  processes  are  so  complexly 
related  that  little  is  gained  by  postulating  separate  processes,  or  (b)  the  factors 
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TABLE  2 

Correlation  Matrix  of  Time  and  Error  Scores,  Line-Drawing  Experiment 


apirst  graders  above  diagonal,  kindergarteners  below  diagonal.  For  clarity,  d'  'imals 
omitted,  r’s  above  .5  are  in  boldface,  and  r's  below  .2  are  deleted  Variable  name  codes  are 
Ave  (average),  I and  II  (first  and  second  session).  Ns  (No  set),  Acc  (accuracy),  Spd  (Speed), 
E (Easy),  and  D (Difficulty). 


are  poorly  defined  with  reference  to  the  two  processes,  (which  seems  probable), 
or  (c)  the  measures  are  poor  indicators  of  the  underlying  processes. 


Analysis  of  Parameter  Independence 

We  turn  next  to  an  examination  of  intercorrelations  among  the  measures.  Typi- 
cally, an  investigator  might  look  at  correlational  data  like  that  in  Table  2,  or 
some  portion  thereof.  The  2 X 2 matrix  in  the  upper  lefthand  comer  is  a likely 
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TABLE  2 (continued) 


10 


11 


12  13 


14 


15 


16  17 


18  19  20  21 


22 


80 

84 

29 

64 

88 

48 

64 

68 

41 

93 

49 

87 

90 

54 

87 

77 

42 

26 

61 

44 

31 

49 

32 

51 

37 

51 

47 

25 

30 

59 

62 

29 

69 

54 

24 

29 

-26 

56 

60 

30 

761 

46 

77 

48 

- 

~ 

~ 

72 

38 

63 

37 

23 

46 

-26 

_ 76 

46 

70 

69 

^ 36 

67 

27 

26 

30" 

47 

-23 

48 

63 

67" 

-20 

42 

28 

48 

50 

-38 

49 

47 

37 

70 

^ 

20 

-32 

46 

-37 

-15 

38 

-28 

31 

-53 

-35 

38 

-51 

38 

26 

55 

-41 

-22 

42 

{continued) 


candidate  It  gives  the  correlation  between  time  and  error  scores  averaged  for 
Mch'ttudent  ore,  the  entire  repeated-measures  design  structure.  The  correlation 
" neput  in  both  groups,  negligibie  in  the  nre-gr.de  data  but  fiuHy  sreable  .n 
The  kindergarten  data.  It  appears  that  there  is  a tendency  for  children  to  rade 
off  siTd  and  accuracy  on  this  usk;  the  faster  a child  draws,  the  more  bkely  he 

bL“kxVma°trix  just  down  the  diagonal  is  another  reasonable  ™!y^  lt 
sho«  i,.  relation  between  time  and  errors  calculated  for  each  undent  from  the 
are  rage  conditions  in  each  session.  Again  there  is  evidence  of  an  -nverse  relation 
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TASlt-E  2 (continued) 


VARIABLE  23  24  25  26  27  28  29  30 


Time  »ve  both  I 36  SI 

Error*  ive  both  2 -26 

Time  «ve  1 3 44 

Enor  »ve  I 4 -23 

Time  ive  It  5 63  SI 

Error  »ve  II  6 -27 


Time  I N*  E 7 26 

D 8 

Acc  E 9 34 

D 10  48 

SpdE  II  41  61 

D 12  40  73 

Enor  I Ni  E 13 

D 14 

AccE  15 

D 16  21 

SpdE  17  25 

D 18  -34 

Time  II  Ni  E 19  H 3? 

D 20  49  42 

Acc  £ 21  33 

D 22  ^46  29 

Spd  E 23  \\64 

D 24  59"\' 

Error  II  Ni  E 25  ^26  =13 

D 26  -26  -54 

AccE  27  -31 

D 28  -26  -57 

SpdE  :v  -43  -48 

D 7 t -32  -74 


-27 

-24 

-20 

SI 

94 

86 

80 

84 

-29 

-32 

~ 

-25 

37 

87 

74 

74 

72 

61 

93 

0*0 

S 2 

81 

90 

-25 

-29 

-33 

-49 

-32 

-22 

-24 

-27 

-32 

~ JT 

~ 

42 

29 

44 

29 

78 

~ 

S9 

49 

56 

46 

84 

- 

75 

91 

58 

SS 

86 

~ 

79 

93 

65 

25 

72 

~ 

65 

59 

76 

-21 

44 

49 

~ 

39 

25 

30 

-23 

~ 

-22 

~ 

-23 

-27 

~ 

-20 

-34 

. 57 

~ 

SI 

57 

48 

47" 

82 

79 

73 

S3 

74 

.73 

81 

S3 

27 

33 

34" 

vN, 

.58 

45 

79 

26 

79 

48^ 

between  time  and  err  jr  scores.  Performance  is  reasonably  stable  from  one 
session  to  the  next  in  firs;  graders,  and  moderately  so  in  the  kindergartners. 

The  remainder  of  the  matrix  presents  the  entire  repeated-measures  design 
structure,  perhaps  the  most  defensible  way  of  presenting  the  raw  data.  The 
correlations  between  time  and  error  measures  are  blocked  in  to  emphasize  a 
particular  property  of  these  data.  The  several  time  measures  tend  to  be  relatively 
highly  correlated,  as  do  the  error  measures,  compared  to  the  inter-measure 
correlations.  But  the  patterns  are  admittedly  fuzzy.  It  is  the  sort  of  matrix  that 
might  be  subjected  to  factor  analysis  in  order  to  clarify  the  underlying  tf  > 
tures. 

However,  raw  scores  are  not  the  measures  to  examine,  given  the  theoretical 
point  of  view  elaborated  previously.  Each  raw  score  is  a combination  of  factor 


\ 
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effects  (cf.  Fig.  5)  which  may  be  interrelated  in  more  or  lest  complex  fashion. 
Let  us  see  what  the  relations  between  the  parameters  of  this  data  set  look  like. 

To  determine  relations  among  the  basic  parameters,  we  will  u.;e  linear  contrasts 
computed  from  the  raw  scores  for  several  of  the  sources  from  the  analysis  of 
variance  (Table  1).  The  correlation  matrix  displayed  in  Table  3 shows  the 
relations  between  certain  contrasts  along  with  two  average  scores,  the  average 
over  ill  conditions  (All)  and  the  average  over  all  No-Set  conditions  (No  Set). 
Esd,  of  the  entries  in  this  table  stands  for  a parameter  from  the  general  linear 
factorial  hypotheses  for  this  experiment.  For  instance,  Tune  All  (Variable  1 ) is 
the  average  response  time  over  all  the  design  variations  for  a given  subject.  This  is 
equivalent  to  an  estimate  of  sw  ^ for  subject  n.  Time  A vs  S (Variable  4)  is 
the  contrast  in  time  scores  between  the  accuracy  and  speed  conditions,  averted 
over  difficulty  and  sessions.  This  is  related  to  the  estimate  of  the  parameter 


for  the  subject  n. 

Table  3 was  obtained  by  computing  these  parameter  estimates  for  each 
subject,  and  entering  these  values  into  a standard  correlation  program.  Since  a 
great  deal  of  information  is  compressed  in  this  table,  it  may  be  worthwhile  to 
describe  its  organization  in  more  detail.  There  was  reason  to  believe  that  the 
kindergarten  and  first-grade  data  might  show  different  patterns,  and  so  separate 
analyses  were  conducted  at  each  grade  level.  Kindergarten  results  are  below  and 
to  the  left  of  the  main  diagonal,  first-grade  above  and  to  the  right.  Tune  and 
error  scores  are  analyzed  separately.  Along  the  margins  are  the  residual  standard 
deviations  for  each  source  (this  is  the  square  root  of  the  error  mean  square  from 
the  analysis  of  variance  calculated  separately  for  each  grade),  and  the  F ratio  for 
the  source  (again  based  on  separate  analyses  for  each  grade).  The  major  elements 
of  an  analysis  of  variance  can  be  reconstructed  from  these  marginal  entries,  and 
the  relative  magnitvde  of  various  sources  and  of  error  terms  can  be  seen. 

The  off-diagonal  entries  in  Table  3 are,  as  noted  earlier.  Pearson  correlations 
between  the  contrast  scores.  To  give  a concrete  idea  of  what  the  relations  in 
Table  3 mean,  two  scatterplots  are  presented  in  Fig.  10.  The  kindergarten  and 


first-grade  data  have  been  combined  in  these  plots. 

The  correlation  matrix  in  Table  3 has  a reasonably  simple  structure  Certain 
correlations  are  very  large  (positive  or  negative)  and  the  rest  tend  to  be  relatively 
small.  Except  for  the  No-Set  vs.  Accuracy-Speed  contrast,  and  the  interaction  of 
this  contrast  with  Difficulty,  the  correlations  within  the  time  and  error  subma- 
tnces  are  high.  With  few  exceptions,  the  correlations  outside  these  submatnees 


are  small. 

This  pattern,  together  with  an  examination  of  the  scatterplots  for  the  larger 
correlations  (those  in  Fig.  10  are  typical),  shows  that  children  who  either  work 
fast  or  make  loU  of  errors  are  relatively  unaffected  by  variation  in  the  situational 
factors.  Set  and  Difficulty.  Moreover,  students  who  are  strongly  affected  by 
variation  in  one  situational  factor  (Set)  ate  strongly  affected  by  variation  in  the 
other  situational  factor  (Difficulty). 
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AVERAGE  TIME  (SECONOSI 


LITTLE  ACCURACY 

EFFECT  SLOWER 

ACCURACY  y*  SPEED  CONTRAST 

FIG.  10  Sample  scallerplois  for  contrasi  estimates,  line-drawing  study. 

A particulaily  interesting  feature  of  these  data  is  that  the  statements  above 
apply  independently  to  the  two  response  measures.  The  correlation  between 
time  and  error  contrast  scores  is  negligible,  with  a few  exceptions  to  be  discussed 
below.  This  result  suggests  that  the  process  model  in  Fig.  9 might  be  reasonably 
adequate  after  all.  The  Set  nd  Difficulty  factors  do  not  fulfill  the  requirements 
for  testing  an  independem  process  model-they  were  not  selected  to  link 
uniquely  to  the  proposed  operations-but  the  model  may  be  a useful  approxima- 
tion. 

In  any  event,  the  purpose  of  this  exercise  is  not  to  promote  any  substantive 
finding.  It  does  seem  noteworthy  that  the  approach  leads  to  a considerable 
simplification  in  the  data  on  its  maiden  voyage.  The  data  in  Table  3 scarcely 
require  further  clarification.  The  basic  structure  is  immediately  apparent,  time 
and  error  comprise  two  independent  components,  the  constituent  parameters  of 
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which  are  highly  interrelated.  It  is  obvious  when  a constituent  drops  out.  For 
instance,  the  No-Set  vs.  Accuracy-Speed  contrast  in  time  scores  is  a substantial 
and  significant  source  of  variance,  but  unrelated  to  any  other  contrast.  This 
contrast  was  chosen  as  the  orthogonal  complement  to  the  Accuracy  vs.  Speed 
contrast,  but  it  may  not  make  psychological  sense.  One  possibility  is  that  the 
No-Set  scores  might  serve  better  as  a covariate.  However,  as  can  be  seen  in  Table 
3,  these  scores  are  highly  correlated  with  the  overall  average,  and  the  pattern  of 
contrast  relations  with  No-Set  and  All  scores  are  practically  identical.  The  matter 
remains  unresolved  at  this  point  in  the  analysis. 

The  kindergarten  sample  also  exhibits  a noticeable  departure  from  time-error 
independence.  Average  time  is  inversely  correlated  with  average  error  (this  was 
observed  in  Table  2),  as  well  as  with  several  error  contrasts.  The  Easy  vs. 
Difficult  contrast  for  time  measures  is  also  correlated  with  the  error  rate.  In  the 
younger  children,  movement  accuracy  is  more  or  less  controlled  depending  c.n 
instructional  set  and  difficulty.  In  the  older  children,  the  two  systems  are  totah/ 
independent.  This  statement  is  more  precise  and  informative  than  the  conclusion 
from  Table  2 that  time  and  error  scores  were  inversely  correlated. 

Analysis  of  the  Line-Drawing  Study:  An  Overview 

Several  features  of  the  data  are  brought  into  focus  by  the  variance-covariance 
analysis  of  specific  linear  contrasts  that  would  be  obscured  in  more  conventional 
analyses.  Let  us  review  briefly  the  main  implications  of  this  analysis: 

1 . Process  independence,  based  on  the  relations  predicted  in  Fig.  8 for  average 
factor  effects,  must  be  rejected.  Factors  linked  to  one  process  affect 
measures  linked  to  other  processes  directly  and  through  interaction. 

2.  Process  independence,  looking  at  subject-source  interactions,  is  irrelevant 
given  the  preceding  result.  But  subject-source  inter*  itions  are  large  enough 
in  at  least  one  instance  to  suggest  that  independence  can  also  be  rejected  by 
this  test. 

3.  Intraprocess  parameter  independence  is  not  testable  in  this  design. 

4.  Interprocess  parameter  independence  is  supported  by  the  low  correlations 
between  time  and  error  contrasts.  This  suggests  that  time  and  errors  tap 
separate  processes  which  the  design  factors  may  be  affecting  in  confounded 
fashion.  Speed  and  accuracy  are  influenced  in  varying  degrees  from  one 
student  to  another  by  variation  in  situational  factors. 

5.  General  parameter  independence  holds  for  the  first-grade  sample,  but  not 
the  kindergarten  sample. 

6.  General-specific  parameter  independence  can  be  rejected  in  almost  every 
instance.  Average  time  and  error  scores  are  highly  correlated  with  respon- 
siveness to  situational  factors. 
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IMPLICATIONS  FOR 
EDUCATIONAL  TEST  PROCEDURES 


Current  test  construction  proceeds  as  if  we  place  buckets  under  psychological 
processes  and  collect  the  output  more  or  less  directly  from  -ndividual  subtest 
measures  (usually  total  correct  responses).  Control  over  variation  in  the  input  is 
modest  at  best,  and  nonexistent  in  most  instances.  This  simple  model  has  been 
extended  by  such  methods  as  factor  analysis,  but  it  seems  to  have  some  inherent 
weaknesses.  It  does  not  provide  a natural  way  for  introducing  process-onented 
variables  and  contextual  variables  into  the  testing  situation  in  an  easy-to-measure 

fashion.  . . , 

Factorial  test  designs  seem  to  provide  a simple  but  informative  way  to  build  a 

test  around  a process  model.  This  approach  is  similar  in  spirit  to  the  notion  of 
facet  tests  discussed  by  Guttman  (1965,  Guttman  & Schlesinger,  1967).  Careful 
analysis  of  a task  may  turn  up  many  factors  of  potential  importance,  but 
fractional  factorial  designs  allow  optimal  arrangement  of  a factorial  test  struc- 
ture so  that  a maximum  amount  of  relevant  information  is  obtained  for  a given 
number  of  test  items  (Kirk,  1968,  Chapters  9,  10).  The  experimental  control 
obtained  in  such  designs  provides  great  sensitivity  with  a reasonable  constraint 

on  test  length.  . . . 

Linear  contrasts  have  come  into  common  use  in  the  experimental  psycho- 
logical literature,  especially  in  the  analysis  of  repeated-measures  designs^  Their 
use  in  test  analysis  as  an  alternative  to  subtest  or  factor  scores  holds  considerable 
promise.  To  be  sure,  there  are  unsolved  problems  connected  with  item  analysis 
and  test  reliability. 


ADDENDUM 

This  volume  has  directed  its  attention  to  educational  matters,  and  to  the  role  of 
cognitive  psychology  in  providing  a better  understanding  of  instructional  pro- 
cesses. The  preceding  remarks  on  test  procedures  are  directed  toward  educators. 
But  for  those  readers  whose  interests  are  more  directly  related  to  cognitive 
psychology,  l should  point  out  again  that  the  analysis  of  contrast  scores  has 
direct  implications  for  tests  of  information-processing  models.  Since  Sternberg  s 
(1969)  landmark  paper  on  the  use  of  factorial  designs  in  evaluation  of  indepen- 
dent cognitive  process  models,  repeated-measures  designs  have  played  a central 
role  in  research  on  cognition.  The  analysis  of  the  variance-covanance  structure 
of  a set  of  contrasts  described  here  is  not  covered  by  the  standard  methods  of 
analysis  now  in  use.  These  are  new  techniques.  They  ask  new  questions  of  data, 
questions  which  are  critical  to  an  understanding  of  individual  differences  in 

thought  and  action. 
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ms  chapter  takes  as  its  genera!  theme  the  actual  and  potential  role  of  task 
analysis,  particularly  information-processing  analysis,  in  instructional  theory  and 
instructional  design.  Some  definitions  are  needed  to  make  this  opening  statement 
sensible.  The  term  “instruction”  is  used  here  in  its  most  general  sense  to  refer  to 
any  set  of  environmental  conditions  that  are  deliberately  arranged  to  foster 
increases  in  competence.  Instruction  thus  includes  demonstrating,  telling,  an 
explaining,  but  it  equally  includes  physical  arrangements,  structure  of  presented 
material,  sequences  of  task  demands,  and  responses  to  the  learner  s actions.  A 
theory  of  instruction,  therefore,  must  concern  itself  with  the  relationship  be- 
tween any  modifications  in  the  learning  environment  and  resultant  changes  in 
competence.  When  the  competence  with  which  we  are  concerned  is  intellectual, 
development  of  a theory  of  instruction  requires  a means  of  describing  states  ot 
intellectual  competence,  and  ultimately  of  relating  changes  in  these  states  to 

manipulations  of  the  learning  environment.  , 

In  developing  a theory  of  instruction  for  intellectual  or  cognitive  domains,  task 
analysis  plays  a central  role.  I mean  by  task  analysis  the  study  of  complex 
performances  so  as  to  reveal  the  psychological  processes  involved.  These  analyses 
translate  “subject-matter”  descriptions  into  psychological  descriptions  of  behav- 
ior They  provide  psychologically  rich  descriptions  of  intellectual  competence 
and  are  thus  a critical  step  in  bringing  the  constructs  of  psychology  to  bear  on 

instructional  design.  T . an_ivtp. 

Psychological  analysis  of  complex  tasks  is  not  a totally  new  idea.  Task  analyses 

arj  performed,  although  not  usually  under  that  name,  in  virtually  all  psychologi- 
cal investigations  of  cognitive  activity.  Whenever  performances  are  analyzed  into 
components-for  experimental,  interpretive,  or  theoretical  purposes-task _ anal- 
ysis of  some  kind  is  involved.  Although  the  study  of  complex  cognitive  tasks  ha. 
never  dominated  empirical  psychology,  there  have  been  significant  occasions  on 


Preceding  page  Mank 


51 


52  LAUREN  B.  RESNICK 


whuch  psychotogms  have  turned  their  attention  to  such  tasks.  Not  all  have  been 
instructional  m intent,  but  several  important  attempts  bear  examination  because 
they  have  substantially  influenced  instructional  theory  or  practice  or  because 
considered  with  instructional  questions  in  mind,  they  oL  bright  bto Te’ 
possible  nature  of  a theory  of  instruction  based  on  cognitive  psycholfgy 
JET  *na!y“  “ Pervasive  411  Psychological  research,  it  is  important  to 

Several'ciifen'1  u ^ ^ pirticularly  in  instructional  design. 
JJjS  * ^ u6  USCd  t0  *Valuate  the  P°tential  contribution  to  instruction 

^eO  n n fP.PrOSChCS  t0  the  Pathological  analysis  of  tasks.  Four  such  criteria 
seem  particularly  important: 

^ /nstrucoW  relevance  Are  the  tasks  analyzed  ones  we  want  to  teach’  TTiat 

rathe"  thlr't  T bCCaUSe  °f  thCir  instrUctional  or  f^eral  social  relevance, 
rather  than  because  they  are  easy  to  study,  have  a history  of  past  research  that 

th^O^  CaSy  tQ  i?rPret'  °r  ^ eSpedaUy  SUited  t0  elucidating  a pobi^of 
theoty.  The  criterion  of  instructional  relevance  implies  that  most  tasks  analyzed 

mil  be  complex  relative  to  many  of  the  laboratory  tasks  that  experimental 
psychologists  find  useful  when  pursuing  noninstructional  questions. 

te™hOlr°0iC'1  formulati°n  0065  the  analysis  yield  descriptions  of  the  task  in 
rZunirv-iTT5  °r,  ° UnitS  reC08niZCd  by  **  Psychological  research 

generally  resisted  m^yS1S  “a  means  of  complex  tasks,  which  have 

Tf,  g°  'Xpenmental  “^sis.  into  contact  with  the  concepts, 
Si  r°neS  0f  pSychol°^-  ^us,  while  the  starting  point  for  instruc- 

teach_r  ? ^ “o  Prescribed  by  soc*al  decisions-what  is  important  to 
teach  the  outcomes  of  such  analysis,  the  terms  used  in  breaking  apart  complex 

ShXT*’  mUSt  detem,ined  by  thC  State  of  theory  and  knowledge  in 

osvohn."01  fT*  .MSy  t0  fUlfiU  b°th  thC  instructi°nal  relevance  and  the 
£ycholog,cal  formulation  criteria  at  once;  instructional  relevance  is  defined  in 

ferent  terms  than  those  which  psychological  researchers  use  in  building  their 

Sc'Th  fNcve.rthc,eSS>  11  ls  imPortant  to  try  to  analyze  instructional  tasks  in 

Sol  V°ntaCt  With  the  CUrrent  body  of  knowledge  and  constructs  in 

Aevexi  ^ h°  m.StrUf  0nal  practice  can  Profit  from  scientific  findings  as 
uiey  exisi  and  as  they  develop. 

Jm^uctaMity  Bec^  our  concern  here  is  with  task  analysis  as  an  aid  to 

^ T’ '•  “ °b"’OUS|qUeStion  is  whcther  to*  results  of  a particular  analysis  are 

uwafle  in  instructional  practice.  In  other  words,  does  the  task  analysis  reveal 

able’’" It  0fttbe,taSk  that  ,*nd  themse,ves  t0  instruction,  i.e.,  that  are  “instruct- 
v . ,S  *C  functlon  of  task  analysis  to  examine  complex  performances  and 
daplay  in  them  a substructure  that  is  teachable -either  through  direct  instruc- 

!*°  J"  hC  COmponents>  or  by  Practice  in  tasks  that  call  upon  the  same  or  related 

1/IUwCSSCS  • 
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Recognition  of  stages  of  competence  Does  the  task  analysis  recognize  a 
distinction  between  early  forms  of  competence  and  later  ones?  Analyses  for 
instructional  purposes  cannot  just  describe  the  expert’s  performance  (although 
such  description  will  almost  always  be  a part  of  such  analyses).  They  must  also 
describe  performance  characteristics  of  r.ovices  and  attempt  to  discover  or  point 
to  key  differences  between  novices  and  experts,  suggesting  thereby  ways  of  arrang- 
ing experiences  that  will  help  novices  become  experts.  Instructional  task  analysis, 
in  other  words,  should  elucidate  the  relations  between  activity  during  learning  and 
competence  that  results  from  learning.  It  should  suggest  ways  of  organizing 
knowledge  to  assist  in  acquisition,  recognizing  that  this  organization  may  differ 
from  organizations  that  are  most  efficient  for  expert  use  of  that  knowledge. 

In  summary,  four  criteria  can  be  applied  in  assessing  the  contributions  of 
psychological  task  analyses  to  instruction  (1)  instructional  relevance;  (2)  psy- 
chological formulation;  (3)  instructability;  and  (4)  recognition  of  stages  of 
competence.  In  the  course  of  this  chapter,  I shall  examine  several  prominent 
approaches  to  the  psychological  analysis  of  complex  tasks  and  consider  their 
contributions  to  instruction  in  light  of  these  criteria.  I begin  with  some  impor- 
tant past  efforts  to  describe  intellectual  competence  in  psychological  terms,  and 
then  turn  to  current  information-processing  approaches  to  task  analysis.  In  order 
to  make  the  domain  of  the  chapter  manageable,  discussion  is  limited  to  analysis 
of  mathematics  tasks.  The  work  discussed,  however,  is  not  intended  to  be 
exhaustive  of  task  analysis  efforts  in  mathematics.  Rather,  it  is  intended  to 
highlight  certain  cases  that  have  considerably  influenced  psychology  or  instruc- 
tion, or  both,  and  that  form  landmarks  in  whatever  might  today  be  written  of  a 
history  and  current  status  report  on  this  branch  of  instructional  psychology. 


A SELECTIVE  HISTORY  Or  TASK  ANALYSIS 

I will  discuss  first  the  work  of  three  predecessors  of  modem  information 
processing  task  analysis,  in  each  case  using  work  on  mathematics  as  the  substan- 
tive example.  These  are:  (a)  work  in  the  associationist/behaviorist  tradition 
(Thorndike,  Gagne);  (b)  work  of  the  Gestalt  school  (especially  Max  Wertheimer); 
and  (c)  the  Piagetian  task  analyses.  Both  substantively  and  methodologically,  the 
approaches  of  these  groups  to  task  analysis  reflect  differences  in  their  theoretical 
positions,  differences  which  in  turn  affect  the  kinds  of  contributions  that  each 
can  make  to  instruct  >n. 

The  Associationist/Behaviorist  Tradition 

Thorndike's  analyses  in  terms  ofS-R  bonds  In  the  early  part  of  this  century, 
experimental  and  educational  psychology  were  closely  allied.  Many  of  the  major 
psychologists  of  the  period  up  to  about  1930  were  actively  engaged  in  both 
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laboratory  research  and  applied  research,  some  of  it  relevant  to  instructional 
practice.  One  of  the  foremost  of  these  was  Edward  L.  Thorndike.  His  work  on 
The  Psychology  of  Arithmetic,  published  in  1922,  represents  his  attempt  to 
translate  the  associationist  theory  of  “laws  of  effect,”  which  he  himself  was 
active  in  developing,  into  a set  of  prescriptions  for  teaching  arithmetic.  In  the 
preface  to  the  book,  Thorndike  states  (1922)  that  there  is  now  a ‘hew  point  of 
view  concerning  the  general  process  of  learning.  We  now  understand  that 
learning  is  essentially  the  formation  of  connections  or  bonds  between  situations 
and  responses,  that  the  satisfyingness  of  the  result  is  the  force  that  forms  them, 
and  that  habit  rules  in  the  realm  of  thought  as  truly  and  as  fully  as  in  the  realm 
of  action  (p.  v] .”  Based  on  this  then  widely  agreed  upon  theory  of  psychological 
functioning,  Thorndike  proposed  a pedagogy  that  has  extensively  influenced 
educational  practice  for  many  years. 

Thorndike  proposed  the  analysis  of  arithmetic  tasks  in  terms  of  specific 
connections,  or  bonds,  between  sets  of  stimuli  and  responses,  and  the  organiza- 
tion of  instruction  to  maximize  learning  of  both  the  individual  bonds  and  the 
relations  among  them.  His  book  began  with  a discussion  of  the  general  domains 
of  arithmetic  for  which  bonds  must  be  formed— for  example,  the  meanings  of 
numbers,  the  nature  of  decimal  notation,  the  ability  to  add,  subtract,  multiply, 
and  divide,  the  ability  to  apply  various  concepts  and  operations  in  solving 
problems.  Thorndike  then  spent  some  fifty  pages  discussing  the  types  of  bonds 
that  give  precise  meaning  to  this  broad  definition  of  the  domain  of  arithmetic. 
His  analysis  did  not  approach  the  level  of  individual  stimulus-response  pairs  but 
remained  on  the  more  general  level  of  connections  between  situations  and  sets  of 
responses.  Citing  numerous  examples,  he  argued  that  certain  kinds  of  bonds 
taught  in  many  of  the  standard  textbooks  of  the  day  were  misleading  and  should 
not  be  taught,  while  other  helpful  bonds  were  neglected  in  pedagogical  practice. 
For  example,  verifying  results  of  computations,  learning  addition  and  substrac- 
tion  facts  for  fractions,  and  solving  problems  in  equation  form  (even  before 
algebra  was  added  to  the  curriculum)  were  considered  “desirable”  bonds,  where- 
as senseless  drill  in  finding  the  lowest  common  denominator  of  fractions  (when 
use  of  any  common  denominator  would  lead  to  solution  of  problems)  and  the 
posing  of  problems  unrelated  to  real-life  situations  led  to  the  formation  of 
“wasteful  and  harmful”  bonds  that  made  arithmetic  confusing  and  unpleasant. 
Discussion  of  appropriate  and  inappropriate  forms  of  measurement  of  the  bonds 
or  elements  of  arithmetic  knowledge  were  also  included.  Thus,  the  total  effect  of 
the  book  was  to  suggest  the  translation  of  a standard  school  subject  into 
terms-collections  of  bonds-that  suggested  applications  of  known  laws  of  learn- 
ing to  the  problems  of  instruction. 

The  laws  of  learning,  and  thus  of  pedagogy,  were  for  Thorndike  those  dealing 
with  such  drill  and  practice  as  would  strengthen  the  bonds.  Questions  such  as 
amount  of  practice,  under-  and  overlearning,  and  distribution  of  practice  were 
considered.  These  are  easily  recognized  as  topics  that  have  continued  to  occupy 
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psychologists-although  rarely  directly  in  the  context  of  school  instruction-and 
that  heavily  though  indirectly  influence  instructional  practice.  What  is  important 
about  Thorndike’s  work,  however,  is  that  he  developed  a concern  not  only  with 
the  laws  of  learning  in  general,  but  also  with  the  laws  of  learning  as  applied  to  r. 
particular  discipline,  arithmetic.  He  left  the  laboratory  to  engage  in  applied 
research,  but  brought  with  him  the  theory,  and  to  a large  extent  the  methodol- 
ogy, of  the  experimental  laboratory.  He  thus  began  a tradition  of  experimental 
work  in  instruction  by  psychologists.  This  tradition  was  interrupted  for  many 
yean  but  is  now  being  revived,  as  the  chapten  in  this  volume  bear  witness. 

Gagnt's  hierarchies  of  learning  sets  While  Thorndike  recognized  the  need  for 
a theory  of  sequencing  in  his  presentation  of  bonds  identified  as  constituting  the 
subject  matter  of  arithmetic,  he  had  no  systematic  theory  of  sequencing  to 
pr  opose.  In  the  decades  following  Thorndike’s  work,  mathematics  educaton  and 
educational  psychologists  (e.g.,  Brownell  & Stretch,  1931 ; Hydle  & Clapp,  1927) 
studied,  with  varying  degrees  of  care  and  precision,  the  relative  difficulty  of 
different  kinds  of  mathematical  problems.  They  thus  empirically,  if  not  theoret- 
ically, extended  Thorndike’s  work  in  instructional  analysis.  The  suggestion 
underlying  this  later  work  was  that  arranging  tasks  according  to  their  order  of 
difficulty  would  optimize  learning,  especially  of  the  more  difficult  tasks.  Skin- 
ner’s (1953)  prescription  for  the  use  of  “successive  approximations”  in  instruc- 
tion represented  a refinement  of  this  basic  idea.  However,  neither  Skinner  nor 
his  immediate  interpreters  proposed  a systematic  strategy  for  generating  the 
order  of  successive  approximations-ie.,  the  sequence  of  tasks  in  instruction.  It 
was  not  until  the  1960s,  and  Gagne’s  work  on  hierarchies  of  learning  (Gagne. 
1962,  1968),  that  any  organized  theory  of  sequencing  for  instructional  purposes 
appeared  wilhin  the  behaviorist  tradition. 

Learning  hierarchies  are  nested  sets  of  tasks  in  which  positive  transfer  from 
simpler  to  more  complex  tasks  is  expected.  The  “simpler”  tasks  in  a hierarchy 
are  not  just  easier  to  learn  than  the  more  complex;  they  are  included  in- 
components of- ‘he  more  complex  ones.  Acquisition  of  a complex  capability, 
then,  is  a matter  of  cumulation  of  capabilities  through  successive  levels  of 
complexity  Transfer  occurs  because  of  the  inclusion  of  simpler  tasks  in  the 
more  complex.  Thus,  learning  hierarchies  embody  a special  version  of  a “com- 
mon elements”  theory  of  transfer. 

Hierarchy  analysis  has  come  into  rather  widespread  use  among  instructional 
designers,  particularly  in  the  fields  of  mathematics  and  science  (see  White, 
1973).  For  the  most  part,  the  analyses  have  been  of  the  kind  Gagni  originally 
described.  Thus,  hierarchies  for  instruction  are  typically  generated  by  answering, 
for  any  particular  task  under  consideration,  the  question:  "What  kind  of  capa- 
bility would  an  individual  have  to  possess  to  be  able  to  perform  this  task 
successfully  were  we  to  give  him  or  her  only  instructions?”  One  or  more 
subordinate  asks  are  specified  in  response  to  this  question,  and  the  question  is 
applied  in  turn  to  the  subordinate  tasks  themselves. 
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IG  ! A leuiimg  hierarchy  pertaining  to  the  addition  of  integers  (From  “Factors  in 
ccquinng  knowledge  of  a mathematical  task”  by  R.  M Gagni,  J.  R.  Mayor.  H.  L.  Garstens. 
ARE  Paradise.  ftycMogictl  Monograph:,  1962,  76  ( V/hole  No.  526).  Copyright  1 962  by 
the  A men  cm  Psychological  Association.  Reprinted  by  permission.) 
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Figure  1 shows  an  example  of  one  of  Gagni’s  hierarchies.  The  tasks  described 
in  the  top- level  box  are  the  targets  for  instruction.  Lower  levels  show  successive 
layers  of  subordinate  capabilities,  that  is,  simpler  tasks  whose  mastery  would 
facilitate  learning  the  more  complex  ones.  Instruction  would  begin  with  the 
lowest-level  capabilities  not  already  mastered  and  proceed  upward.  The  tasks  at 
the  low  end  of  the  hierarchy  can  be  analyzed  further,  depending  on  assumptions 
about  the  learner’s  knowledge.  It  is  assumed  that  the  more  elementary  capabil- 
ities are  learned  through  more  elementary  types  of  learning.  In  other  words, 
implicit  in  a complete  learning  hierarchy  for  a task  such  as  the  one  shown  ir  Fig. 
1 is  another  hierarchy  of  “types  of  learning,”  progressing  from  simpk  S-R 
learning,  through  chaining  and  discrimination,  to  higher-level  concept  5tnd  rule 
learning,  as  shown  in  Fig.  2.  A more  complex  task  such  as  problem  solvng  would 
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FIG  2 Gagn6s  hierarchy  of  type*  of  learning.  (From  R M.  Gagne.  The  Conditions  of 
learning  (2nd  ed  >,  New  York:  lloil.  Rinehart  A Winston.  1970.  Copyright  1965  by  Holt. 
Rinehart  A Winston  Repnnted  by  permission  I 
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involve  more  concept  and  rule  lean.ing  and  would  lead  lo  the  discovery  of 
progressively  higher -order  generalizable  rules. 

Gagne’s  hierarchy  analyses  appear  to  fi  rt  with  information-processing  concep- 
tions of  psychology,  but  not  to  come  to  grips  with  them.  There  is  a kind  of 
implicit  process  analysis  involved  in  the  method  of  hierarchy  generation  Presurri- 
ably.  in  order  to  answer  the  question  that  generates  subordinate  tasks,  one  must 
have  in  mind  some  idea  of  what  kinds  of  operations- mental  or  otherwise  an 
individual  engages  in  when  he  or  she  performs  the  complex  task.  However , this 
model  of  performance  is  left  entirely  implicit  in  Gagne's  work 

Gestalt  Psychology  and  the  Analysis  of  Mathematical  Tasks 

Gestalt  psychology  was  an  immigrant  1 1 America  In  its  irst  generation  it  spoke 
s language  so  unlike  the  rest  of  Amencan  psychology  that  it  was  barely  listened 
to  Now.  in  a period  when  we  speak  easily  of  cognition  and  mental  operations, 
the  gestalt  formulaUons  take  on  more  interest  for  us.  Gestalt  theory  was 
fundamentally  concerned  with  perception  particularly  the  apprehension  of 
’’structure  ’’  With  respect  to  the  complex  pfocrss-%  involved  in  thinking,  the 
concept  of  structure  led  to  a concern  with  ’un«k ^standing  or  insight,  often 
accompanied  by  a visual  representation  of  some  kind  With  respect  to  problem 
solving,  the  central  conrem  was  with  the  dynamics  of ’’productive  ihtnktng  " 
Several  pstalt  psychologists,  particularly  Wertheimer  <1959*  and  his  students 
(Katona.  1940.  Luchins  & Luchins.  |9T0).  attempted  to  apply  the  basic  pnnet 
pies  of  gestalt  interpretation  to  problems  of  utstruction  and.  in  particular,  to  the 
teaching  of  mathematics  It  is  reasonable  to  imagine  that  mathematics,  especially 
^ometry.  was  of  pirticu-ar  interest  to  gestalt  theorists  because  of  its  high  degree 
of  internal  structure  and  its  susceptibility  to  visual  representation 

Wertheimer  contrasted  his  theory  of  productive  thinking  both  with  traditional 
logic  and  with  associations  descriptions  of  problem  striving  Neither  of  these,  he 
claimed,  gives  a complete  picture  of  how  new  knowledge  is  produced  by  the 
individual  With  respect  to  teaching,  he  was  concerned  that  prevalent  methi*ds  of 
teaching,  with  emphasis  on  practice  and  recall,  produced  senseless  combina 
tions”  rather  than  productive  problem  solving  based  on  the  structure  of  the 
problem 

Werthemwr  s (|959)  book.  Pt,>ducmr  Th’jikmg.  originally  published  tn  !**■*< 
discusses  work  on  several  mathematics  problems  for  example  finding  (he  area 
of  a parallelogram,  proving  the  equality  of  angles.  Gauss  s formula  for  the  sum  of 
j senes,  symmetry  of  oscillations,  anthmetic  calculations,  and  the  sum  ot  angles 
of  a figure  Analysis  of  these  tasks,  for  Wertheimer  consnsted  of  displaying  the 
problem  structure  on  which  algorithms  are  based,  rather  than  analyzing  actu. . 
performance  Thus,  for  example,  the  problem  of  finding  the  area  of  a pa.alklo- 
gram  was  seen  as  a ^loblem  cf  ’’gap  fitting  too  much  on  one  side  too  little  on 
the  other  (see  Fig  3)  Once  the  gap  is  filled  and  a rectangle  formed  a general 
principle  for  finding  area  can  be  appised  It  is  recognition  of  the  nature  of  the 
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FIG.  3 Wertheimer’s  ties  of  a parallelogram  problem. 

problem-the  possibility  of  transforming  the  parallelogram  into  a rectangle -that 
constitutes  for  We.theimer  “understanding”  or  “insight.”  Solutions  that  follow 
from  this  understanding  are  for  him  true  solutions,  elegant  ones.  Those  that 
“blindly”  apply  an  algorithm,  even  if  the  algorithm  should  work,  are  “ugly” 
(Greeno.  Chapter  7 of  this  volume,  discusses  another  example  from  Wertheimer). 

Though  Wertheimer  talked  little  about  general  schemes  for  instruction,  his 
notions  imply  the  necessity  of  analyzing  tasks  into  components,  perceptual  and 
structural,  such  that  their  nature  in  relation  to  the  whole  problem  is  clear.  Only 
when  the  true  structures  of  problems  are  understood  can  principles  derived  from 
them  be  properly  generalized.  Whenever  possible,  it  should  be  left  to  the  student 
to  discover  both  the  problem  and  its  solution.  Instruction,  if  it  should  be 
necessary,  should  proceed  in  a way  consistent  with  the  internal  structure  of  the 
problem,  and  in  the  proper  sequence,  so  that  a true  understanding  is  gained  by 
the  child,  leading  to  solution  Just  how  the  undemanding  of  components  and 
their  part-whole  relationships  is  io  be  taught  is  not  made  clear  Wertheimer 
suggested  that  exercises  could  be  introduced  which  focus  students’  attention  on 
certain  aspects  of  the  problem  structure,  which  should  increase  the  likelihood  of 
achieving  insight.  He  also  spoke  of  certain  operations  involved  in  thinking 
processes -grouping,  reorganizing,  structurization-  from  which  one  might  devise 
ways  of  teaching 

Piagetian  Analyses 

In  discussing  Piagetian  task  analysis  we  must  consider  two  quite  distinct  bodies 
of  literature  (1)  Piaget’s  own  work  (and  that  of  others  in  Gcnev»J;  and  (2) 
attempis-largelv  by  American  and  Briiish  psychologists  lo  isol  ite  the  specific 
concepts  and  processes  underlying  performance  on  Piagetian  tasks.  1 will  discuss 
these  in  succession 

G,  even  work  Much  of  Piagel's  own  work  (on  number,  geometry,  space,  etc.) 
is  heavily  mathematical  in  orientation  It  seeks  to  characterize  cognitive  develop- 
ment in  terms  of  a succession  of  logical  structures  commanded  by  individuals 
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over  time.  The  “clinical  method”  used  by  Piaget  in  his  research  yields  great 
quantities  of  raw  process  data-protocols  of  children’s  responses  to  various  tasks 
and  questions.  The  protocols  are  interpreted  in  terms  of  the  child’s  “having”  or 
“not  having”  structures  of  different  kinds.  Explanation  of  a task  performance 
for  Piaget  consists  of  descriptions  of  the  logical  structures  that  underly  it,  and  of 
the  structures  that  ontologically  preceded  and  therefore  in  a sense  “gave  birth 
to”  the  current  ones. 

Piaget’s  tasks  are  chosen  to  exemplify  logical  structures  that  are  assumed  to  be 
universal.  Many  of  them  turn  out  to  involve  mathematics,  but  by  and  large  not 
the  mathematics  that  is  taught  in  school.  One  result  has  been  considerable 
debate  over  whether  the  Piagetian  tasks  should  become  the  basis  of  the  school 
curriculum,  whether  they  are  teachable  at  all,  and  whether  they  set  limits  on 
what  other  mathematical  content  can  be  taught  (for  differing  points  of  view  on 
this  matter,  see  Furth,  1970;  Kamii,  1972;  Kohlberg,  1968;  Rohwer,  1971). 
Although  until  recently  Piaget’s  work  has  not  been  motivated  by  instructional 
concerns,  others  have  tried  to  interpret  his  work  for  instruction.  This  has  often 
resulted  in  at  1c:  i partially  competing  interpretations. 

Piaget’s  most  important  contribution  to  t^sk  analysis  is  probably  his  pointing 
out,  in  compelling  fashion,  that  there  are  important  differences  between  children 
and  adults  in  the  way  they  approach  certain  tasks,  the  knowledge  they  bring  to 
them,  and  the  processes  they  have  available.  However,  his  analysis  in  terms  of 
logic  leaves  questionable  the  extent  to  which  his  descriptions  elucidate  the 
“psychologies”  of  behavior  on  these  tasks,  that  is,  w!  at  people  actually  do.  It  is 
oertainly  the  case  that  for  psychologists  accustomed  to  the  explicit  detail  of 
information-processing  analyses,  the  leap  from  observation  to  references  con- 
cerning logical  structure  is  often  difficult  to  follow  in  Piaget’s  work. 

Experimental  analyses  of  Piaget's  tasks.  Much  of  the  English  -language  research 
literature  on  Piaget  has  focused  on  locating  specific  concepts  or  component 
processes  underlying  the  ability  to  perform  v/ell  on  particular  tasks.  Conserva- 
tion tasks  have  been  mostly  heavily  studied,  classification  tasks  probably  next 
most  heavily.  There  has  been  relatively  little  study  of  tasks  characteristic  of  the 
stage  of  formal  rather  than  concrete  operational  thinking  (see  Glaser  & Resnick, 
1972). 

Two  basic  strategies  can  be  distinguished  in  this  research.  One  is  to  vary  the 
task  in  small  ways  to  allow  inferences  about  the  kinds  of  cognitive  processes 
being  used.  An  example  of  this  first  strategy  is  a series  of  studies  by  Smedslund 
(1%4,  1967a,  b),  in  which  he  presented  double  classification  tasks  with  attri- 
butes covered  or  uncovered,  labeled  or  visually  presented.  From  performance  on 
these  variations,  he  concluded  that  processing  was  probably  done  at  a symbolic 
ratner  than  a perceptual  level,  that  memory  was  involved,  and  that  some  kind  of 
analytic  mechanism  might  be  involved  in  committing  perceptions  or  symbols  to 
memory. 
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The  second  research  strategy  is  to  instruct  children  in  a concept  or r ;ocess 
hypothesized  to  underly  performance  on  some  Piagetian  task,  and  then  test  to 
see  whether  they  thereby  acquire  the  ability  to  perform  the  task.  Examples  of 
research  of  the  second  kind  are  Gelman's  (1969)  study  on  training  conservation 
by  teaching  discrimination  of  length,  density,  and  number;  and  Bearison’s 
(1969)  study  inducing  conservation  by  training  in  equal-unit  measurement  of 
liquid  quantity.  Of  the  two  approaches,  the  second  is  more  directly  interesting  in 
the  present  context,  because  the  strategy  of  instruction  demands  an  analysis  in 
terms  of  instructable  components. 

Assessment  of  the  Approaches  with  Reference  to  Instruction 

How  do  these  past  approaches  to  task  analysis  match  the  criteria  outlined  for 
instructional  relevance?  To  what  extent  does  each  address  itself  to  tasks  of 
instructional  interest?  To  what  extent  do  the  terms  of  analysis  provide  a link  to 
the  main  body  of  psychological  theory  and  knowledge?  Are  instructable  units 
identified?  Do  the  analyses  d^’inguish  usefully  between  performance  of  learners 
and  of  experts? 

Instructional  relevance.  With  respect  to  the  choice  of  tasks,  only  Thorndike 
and  Gagni  show  a clear  instructional  orientation.  Their  tasks  are  drawn  from 
school  curricula,  and  where  formal  validation  studies  of  their  analyses  occur, 
they  are  to  a large  extent  based  on  the  effectiveness  of  actual  instruction  in  the 
units  identified  (e.g.,  Gagr.e,  Mayor,  Garstens,  & Paradise,  1962).  Wertheimer 
and  the  others  of  the  Gestalt  school  analyze  a few  tasks  drawn  from  mathe- 
matics, but  nvJte  no  attempt  to  analyze  a whole  range  of  subject  matter. 
Further,  despite  some  discussion  of  productive  thinking  as  a generalized  phe- 
nomenon of  educational  concern,  there  is  no  analysis  of  it  as  such  in  Wert- 
heimer s work.  It  seems  likely  that  Wertheimer  chose  tasks  from  mathematics 
that  would  best  lend  themselves  to  analysis  in  terms  of  perceptual  “Gestalten” 
rather  than  selecting  those  of  particular  importance  to  instruction.  On  the 
criterion  of  types  of  tasks  analyzed,  Piaget’s  work  is  even  less  directly  relevant  to 
instruction.  There  is,  in  fact,  serious  question  whether  the  concrete  operations 
tasks  he  studied  ought  to  be  the  objects  of  instruction,  since  they  are  psychological 
“indicators”  of  general  cognitive  status  rather  than  socially  important  tasks,  and 
since  they  appear,  at  least  in  Western  and  certain  urbanized  cultures,  to  be  acquired 
without  formal  schooling  in  the  course  of  development  (Glaser  & Resnick,  1972). 

It  may  be,  however,  that  formal  operations  need  to  be  taught  explicitly, since  it  is 
by  no  means  clear  that  formal  operational  thinking  is  universally  acquired 
(Neimark,  1975). 

Psychological  crmulation.  Each  approach  addresses  well  the  analysis  of  com- 
plex tasks  in  terms  of  the  fundamental  psychological  constructs  relevant  to  their 
own  times  and  theories.  Thus,  Thorndike’s  analyses  describe  arithmetic  in 
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terms  of  the  basic  psychological  unit  of  then-current  theory,  the  associationist 
bond,  and  thus  suggest  specific  pedagogical  practices  drawn  from  known  princi- 
ples of  learning.  Gagne’s  analyses  interpret  instructional  tasks  in  the  terms  of 
behavioral  learning  psychology:  transfer,  generalization,  and  so  forth.  His  con- 
cern for  the  learning  of  “higher  processes”  such  as  rules  and  principles  suggests 
some  sharing  of  concern  with  cognitive  psychology;  however,  basic  cognitive 
processes,  such  as  memory  and  perception,  are  alluded  to  only  as  general  abilities 
assumed  not  to  be  instructable  or  further  analyzable.  Wertheimer’s  analyses  of 
mathematical  tasks  explicitly  indicate  how  gestalt  field  theory  would  inteipret 
problem  solving  and  learning  in  these  domains  Finally,  Piaget’s  analyses, 
like  Wertheimer’s,  attempt  to  show  that  performance  on  complex  tasks  can  be 
interpreted  in  terms  of  underlying  structures.  For  Piaget  and  Wertheimer,  expli- 
cation of  the  structures  constitutes  psychological  explanation  of  the  perfor- 
mance. Both  are  concerned  wPh  characterizing  the  broad  outlines  of  cognitive 
structures  rather  than  with  detailing  the  processes  involved  in  building  or 
utilizing  these  structures.  Only  in  the  experimental  analyses  of  Piagetian  tasks  do 
we  begin  to  find  attempts  to  interpret  peiformance  more  explicitly,  that  is,  in 
information-processing  tern ,. 

Instructability.  With  respe  t to  the  criterion  of  instructa’iility,  Thorndike 
and  Gagn i are  directly  on  target.  Their  aim  in  task  analysis  is  to  facilitate 
instruction,  and  me  honds  ,r  subordinate  capabilities  identified  are  quite  clearly 
described  as  instructable  components.  Wertheimer  is  more  difficult  to  assess  with 
respect  to  this  criterion.  His  analyses  are  specific  to  particular  tasks.  They  do 
display  the  basic  structure  of  each  task  and  therefore  suggest  quite  directly  ways 
of  teaching  that  are  likely  to  produce  maximum  understanding,  transfer,  and 
elegance  of  solution;  but  there  are  no  general  units  identified  which  would  be 
useful  across  a number  of  tasks.  Piaget’s  own  analyses  involve  no  identification 
of  instructable  units.  However,  a review  of  studies  involving  instruction  in 
Piagetian  tasks  (Glaser  & Resnick,  1972)  suggests  that  Piagetian  concepts  are 
indeed  instructable,  or  at  least  lend  themselves  to  analysis  into  certain  prerequi- 
site skills  which  may  be  instructable.  The  studies  also  suggest  how  delicate  the 
process  of  task  analysis  and  instruction  is  for  tasks  of  any  psychological  com- 
plexity. It  is  necessary  both  to  identify  the  appropriate  underlying  processes  or 
concepts  and  to  find  effective  ways  of  teaching  them.  Identifying  one  underlying 
concept  will  rarely  suffice  for  full  success  in  instructional  efforts  because  there 
may  be  several  abilities  which  must  be  combined,  and  the  absence  of  any  one 
may  lead  to  failure  to  learn  the  target  task.  Further,  “instruction”  itself  is  a very 
delicate  matter.  There  are  no  simple  rules  for  constructing  situations  that  will 
convey  the  concepts  or  processes  to  be  taught  in  a clear  way.  Even  with  an 
appropriate  task  analysis,  the  mapping  from  identified  components  to  instruc- 
tional strategies  remains  very  much  a matter  of  artful  development. 

Recognition  of  stages  of  competence.  Finally,  we  turn  to  the  novice-expert 
distinction,  the  criterion  of  recognition  of  stages  of  competence.  On  this  matter 
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Thorndike  is  not  very  explicit.  He  recognizes  a need  for  sequencing  instruction 
scientifically,  but  offers  no  psychological  theory  as  to  how  to  proceed.  Indeed, 
the  impression  left  is  that  the  difference  between  novices  and  experts  lies  solely 
in  how  many  bonds  have  been  learned  and  how  well-practiced  these  are.  That 
there  may  be  important  differences  in  the  organization  of  knowledge  for  novices 
and  experts  is  at  best  only  hinted,  and  not  seriously  explored.  Gagni  s particular 
contribution  within  the  behavioral  perspective  is  a practical  method  for  generat- 
ing sequences  of  instructable  tasks.  In  his  general  notion  of  transfer— inclusion  of 
simple  task',  in  more  complex  ones— Gagni  offers  a strong  suggestion  for  how  to 
organize  instruction  for  purposes  of  acquiring  higher-order  knowledge  and  skills. 
Thus,  at  a certain  level,  the  criterion  of  recognizing  and  dealing  with  differences 
between  novices  and  experts  is  explicitly  met  in  learning-hierarchy  analyses. 
Wertheimer’s  analyses,  by  contrast,  attend  not  at  all  to  the  distinction  between 
novices  and  experts.  The  implicit  assumption  is  that  behavior  in  accord  with 
good  structural  principles  is  “native”  and  has  simply  been  stamped  out  or 
squelched  by  th?  drill  orientation  of  schools. 

Piaget,  of  course,  is  particularly  attuned  to  changes  in  the  structures  available 
to  people  at  different  stages  in  their  intellectual  development.  In  fact,  with 
respect  to  instruction,  Piaget’s  largest  contribution  is  very  possibly  the  highlight- 
ing of  substantive  changes  in  competence  which  occur  in  the  course  of  develop- 
ment. Piaget’s  work  makes  it  impossible  to  ..more  differences  between  perfor- 
mance strategies  of  novices  and  experts -whe«  her  or  not  we  find  Piaget’s  own 
analyses  convincing  or  accept  his  explanations  of  how  these  changes  occur.  By 
contrast,  the  experimental  or  neo-Piagetian  work  is  uneven  on  this  criterion.  For 
the  most  part,  these  studies  investigate  single  tasks  and  look  for  competence 
versus  incompetence  rather  than  for  stages  or  transformations  of  competence. 
There  are  a few  exceptions,  largely  in  recent  attempts  to  interpret  changes  in 
performance  on  Piagetian  tasks  in  terms  of  information-processing  constructs 
(see  Klahr,  in  press).  Investigators  have  attempted  to  analyze  sequences  of 
Piagetian  tasks  so  that  adding  one  or  two  simple  processes  to  an  individual’s 
repertoire,  or  modifying  extant  processes,  can  be  shown  to  account  for  succes- 
sively more  complex  performances  on  the  Piagetian  tasks.  This  work  takes  infor- 
mation processing"  as  its  theoretical  orientation  and  makes  heavy  use  of  computer 
simulation  strategies  for  formal  analyses.  It  thus  forms  a useful  bridge  to  the  second 
part  of  this  chapter,  which  is  concerned  specifically  with  the  oresent  and  potential 
role  of  information-processing  task  analysis  in  instructional  design. 


INFORMATION-PROCESSING  ANALYSES 
FOR  INSTRUCTIONAL  PURPOSES 

A major  branch  of  cognitive  psychology  today  carries  the  label  “information 
processing."  As  is  often  the  case  with  an  emerging  branch  cf  study,  it  is  -asier  to 
point  to  examples  of  information-processing  research  than  to  give  a complete  or 
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consensual  definition  of  it.  Nevertheless,  psychologists  working  in  this  area  tend 
to  share  certain  assumptions  as  well  as  certain  research  strategies. 

Information-processing  studies  attempt  to  account  for  performance  on  cogni- 
tive tasks  in  terms  of  actions  (internal  or  extenal)  that  take  place  in  a temporal- 
ly ordered  flow.  A distinction  is  generally  drawn  between  data,  or  information, 
and  operations  on  data,  or  processes.  Thus,  the  concern  of  information-process- 
ing psychology  is  with  how  humans  act  upon  (process)  data  (information). 
Frequently  -but  not  universally-information-processing  models  for  cognitive 
tasks  are  expressed  as  “programs"  for  performance  of  particular  tasks  These  are 
often  formalized  as  computer  programs  whose  theoretical  validity  is  judged  by 
their  ability  to  simulate  actual  human  performance. 

Most  information-processing  theories  and  models  find  it  useful  to  characterize 
the  human  mind  in  terms  of  the  way  information  is  stored,  accessed,  and 
operated  upon.  Distinctions  are  made  among  different  kinds  or  “levels”  of 
memory.  While  the  details  and  ihe  labels  vary,  most  theories  distinguish  between 
a sensory  intake  register  of  some  kind  through  which  information  from  the 
environment  enters  the  system,  a working  memory  (sometimes  called  short-term 
or  intermediate-term  memory)  in  which  the  actual  processing  work  goes  on,  and 
a long-term  (semantic)  memory  in  which  everything  one  knows  is  stored, 
probably  permanently.  Within  this  general  structure,  working  memory  is  pivotal. 
It  is  only  by  being  processed  in  working  memory  that  material  from  the  external 
environment  can  enter  the  individual's  long-term  store  of  knowledge,  and  only 
by  entering  working  memory  can  information  from  the  long-term  store  be 
accessed  and  used  in  the  course  of  thinking.  Processing  in  working  memory  is 
usually  assumed  to  be  serial-one  action  at  a time.  Further,  working  memory  is 
considered  to  have  a limited  number  of  “slots”  that  can  be  filled,  30  that  it  is 
only  by  rehearsing  or  by  “chunking”  material  into  larger  units  (so  that  a body  of 
interrelated  information  takes  up  a single  slot)  that  loss  of  information  from 
working  memory  can  be  avoided. 

Information-processing  analyses  of  instructional  tasks  share  these  general 
assumptions  as  well  as  a body  of  research  methods  that  have  been  developed  for 
testing  the  validity  of  models  of  cognitive  performance.  Information-processing 
analyses  are  clearly  distinguished  from  behaviorist  ones  (Thorndike  and  Gagne  in 
the  present  case)  by  their  explicit  attempts  to  describe  internal  processing.  They 
differ  from  the  cognitivist  Gestalt  and  Piagetian  positions  in  their  attempts  to 
describe  the  actual  flow  of  performance-to  translate  “restructuring"  or  “logical 
operations”  into  temporally  organized  sequences  of  actions. 

In  characterizing  information-processing  analyses  of  complex  tasks,  it  is  useful 
tc  distinguish  between  rational  and  empirical  analyses.  Rational  analyses  are 
descriptions  of  “idealized"  performances-that  is,  performances  that  succeed  in 
responding  to  task  demands,  often  in  highly  efficient  ways,  but  not  necessanly 
the  ways  in  which  humans  actually  perform  the  tasks.  Work  in  artificial  intelli- 
gence can  be  considered  a form  of  rational  task  analysis  which  is  today  being 
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applied  to  increasingly  complex  kinds  of  tasks.  So  can  some  much  less  ambitious 
analyses  of  simple  tasks,  some  of  which  are  discussed  below.  Empirical  task 
analyses  are  based  on  interpretation  of  the  data  (errors,  latencies,  self-reports, 
eye  or  hand  movements,  etc.)  from  human  performance  of  a task;  the  aim  of 
such  analyses  is  to  develop  a description  (model)  of  processes  that  would 
account  for  those  data.  In  practice,  rational  and  empirical  analyses  are  rarely 
sharply  separated.  Rational  analyses,  for  example,  may  provide  the  starting  point 
for  empirical  data  collection,  leading  to  an  iterative  process  in  which  successively 
closer  matches  to  human  performance  models  are  made.  Nevertheless,  the 
distinction  is  a useful  one  in  considering  the  kinds  of  investment  in  information- 
processing  analysis  that  will  be  most  valuable  for  instruction. 

In  the  remainder  of  this  chapter,  I consider  information-processing  analyses  of 
several  of  these  kinds.  I describe  first  some  of  our  work  in  rational  process 
analysis,  work  that  war  explicitly  concerned  with  instructional  design  require- 
ments. Next,  1 describe  some  empirical  analyses  of  the  same  kinds  of  relatively 
simple  tasks,  and  consider  the  relationship  between  rational  and  empirical 
analysis  for  instructional  purposes.  In  a final  section,  I consider  the  problem  of 
more  complex  tasks-problcm  solving,  reasoning,  tasks  that  we  use  as  measures 
of  “intelligence”  and  aptitude-and  wliat  the  role  of  formal  simulations  and 
empirically  studied  information  process  models  might  be  for  instruction  in  such 
domains. 


Rational  Task  Analysis  for  Curriculum  Design 

Rational  (ask  analysis  can  be  defined  as  an  attempt  to  specify  processes  or 
procedures  that  would  be  used  in  highly  cfficieni  pc  formance  of  some  ic*k.  The 
result  is  a detailed  description  of  an  "idealized”  performance-one  that  solve; 
the  problem  in  minima)  moves,  does  little  “backtracking,”  makes  few  or  no 
errors.  Typically  a rational  task  analysis  is  derived  from  the  structure  of  the 
subject  matter  and  makes  few  explicit  assumptions  about  the  limitations  of 
human  memory  capacity  or  perceptual  encoding  processes.  In  many  cases 
informal  rational  task  analysis  of  this  kind  can  serve  as  a way  of  prescribing  what 
to  teach  (i.e.,  teach  children  to  perform  the  processes  laid  out  in  the  analyses), 
and  instructional  effectiveness  serves  as  a partial  validation  of  the  analysis. 

In  order  to  convey  the  flavor  and  intent  of  rational  process  analysis  as  applied 
to  instruction,  I will  describe  in  some  detail  part  of  our  own  early  work  on 
simple  arithme'ic  tasks.  This  work  grew  initially  out  of  an  attempt  to  apply 
learning  hierarchy  theory  to  the  problem  of  designing  a preschool  and  kinder- 
garten ma'.iiematics  curriculum.  We  found  it  necessary,  in  order  to  secure 
agreement  among  our  staff  on  the  probable  ordering  of  tasks,  to  miroducc  a 
method  in  which  the  processes  hypothesized  to  be  involved  in  a particular  task 
performance  were  explicitly  hid  out  (see  Resnick.  Wang.  & Kaplan.  1973). 
Figures  4 and  5 show  examples  of  the  analyses  th.it  resulted.  The  top  box  in 
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FIG.  4 Analysis  of  Objective  1-2:C,  “Given  a fixed  ordered  set  of  objects,  the  child  can 
count  the  objects."  (From  "Task  analysis  in  curriculum  design:  A hierarchically  sequenced 
introductory  mathematics  curriculum"  by  L.  B.  Resnick,  M.  C.  Wang,  A J.  Kaplan,  Journal 
of  Applied  Behavior  Analyst*.  1973,  6,  679-710.  Copyright  1973  by  the  Society  for  the 
Experimental  Analysis  of  Behavior.  Reprinted  by  permission.) 

each  figure  shows  the  task  being  analyzed,  the  entry  above  the  line  describing 
the  presented  stimulus  and  the  entry  below  the  line  the  expected  response.  The 
second  row  in  each  figure  shows  a hypothesized  sequence  of  behaviors  engaged 
in  as  the  presented  task  is  performed.  Arrows  indicate  a temporally  organized 
procedure  or  routine.  The  lower  portions  of  the  charts  identify  capabilities  that 
are  thought  to  be  either  necessary  to  performance  (i.e.,  prerequisite  to)  or 
helpful  in  learning  (i.e.,  propadeutic  to)  the  main  task.  The  identified  prerequi- 
site and  propadeutic  tasks  were  used  to  build  hierarchies  of  objectives  that 
formed  the  basis  of  a curriculum. 
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FIG.  5 Analysis  of  Objective  1-2:E,  “Given  a numeral  stated  and  a set  of  objects,  he  child 
can  count  out  a subset  of  stated  size.”  (From  “Task  analysis  in  curriculum  c'esign:  A 
hierarchically  sequenced  introductory  mathematics  curriculum”  by  L.  B.  Resnii  k,  M.  C. 
Wang,  & J.  Kaplan,  Journal  of  Applied  Behavior  Analysis,  1973,  6,  679-710.  Coy  right 
1973  by  the  Society  for  the  Expe.imental  Analysis  of  Behavior.  Reprinted  by  permis..  - ) 

At  the  outset,  the  process  analyses  functioned  for  us  as  aids  in  developing 
prescriptions  for  instruction.  We  carried  out  the  kind  of  research  that  seemed 
most  directly  relevant  to  that  prescriptive  function.  That  is,  we  looked  at  the 
extent  to  which  the  analyses  generated  valid  task  sequences,  sequence:  which 
aided  learning  of  the  most  complex  tasks  in  the  set.  Two  research  strategies  were 
involved.  First,  we  conducted  scaling  studi  s.  In  these  studies,  tests  on  a number 
of  tasks  were  given  to  a sample  of  the  children  prior  to  instruction,  and  the 
results  were  evaluated  for  the  extent  to  which  the  tests  formed  a Guttman  scale 
in  accord  with  the  predicted  prerequisite  relations  (e.g.,  Wang,  1973:  Wang, 
Resnick,  & Boozer,  1971).  A good  approximation  to  a Guttman  scale  implied 
strong  prerequisite  relations  among  the  tasks-relations  that  specified  optimal 
teaching  orders.  A second  set  of  studies  (Caruso  & Resnick,  1971;  Resnick, 
Siegel,  & Kresh,  1971)  involved  more  direct  assessment  of  transfer  relations 
among  small  sets  of  tasks.  Tasks  in  a small  hierarchy  were  taught  in  simple-to- 
complex  and  complex-to-simple  orders.  We  then  looked  at  transfer  effec's  on 
trials  to  criterion  and  related  measures.  These  studies  showed  that  teach,  ; in 
hierarchical  sequence  was  the  best  way  of  assuring  that  most  or  all  o the 
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children  in  a group  learned  all  the  objectives.  For  the  minority  who  were  capable 
of  learning  the  more  complex  objectives  without  intervening  instruction,  how- 
ever, “skipping”  of  prerequisites  was  a faster  way  to  learn.  What  these  children 
apparently  did  was  to  acquire  the  prerequisites  in  the  course  of  learning  the 
more  complex  tasks.  An  important  instructional  question  raised  by  these  results 
is  whether  we  can  match  instructional  strategies  to  individuals’  relative  ability  to 
learn  on  their  own-that  is,  without  going  through  direct  instruction  in  all  of  the 
steps  of  a hierarchy.  Before  we  are  likely  to  answer  that  queJion  well,  however, 
we  will  probably  need  more  systematic  theories  than  we  now  have  available  of 
how  learning  occurs  with  minimal  instruction  (cf.  Resnick  & Glaser,  in  press). 

he  kind  of  task  analysis  used  in  these  studies  served  to  describe  performance 
in  temporally  organized  sequences  and  to  identify  general  information-processing 
abilities,  such  as  perceptual  processing  (e.g.,  Fig.  4, 111c  and  IVc),  memory  (e.g., 
Fig.  5, 11a  and  11c),  and  temporal  synchrony  (e.g.,  Fig.  4, 111a),  that  are  called  on  in 
performing  a specific  complex  task.  As  formal  information-processing  models, 
however,  the  analyses  were  incomplete  because  they  did  not  specify  every  step 
(for  example,  stop  rules  were  not  typically  specified  where  recursive  loops 
occurieri)  nor  did  they  explicitly  deal  with  overall  control  mechanisms  or  total 
memory  load.  In  addition,  they  were  not  empirically  verified  as  process  analyses. 
Although  many  observations  of  performance  were  made,  there  was  no  attempt 
to  match  predicted  or  “ideal"  performance  against  actual  performances.  The 
hierarchy  tests  confirmed  the  validity  of  the  task  sequencing  decisions  made  on 
the  basis  of  the  analyses,  but  they  did  not  necessarily  confirm  the  details  of  the 
analyses.  Performance  strategies  different  from  those  in  our  analyses  might  have 
produced  similar  sequences  of  acquisition  or  transfer  effects.  Thus,  while  the 
scaling  and  transfer  studies  met  instructional  needs  quite  well,  they  did  not 
constitute  validations  of  the  models’  details.  For  this  purpose,  the  strategies  of 
empirical  task  analysis  are  needed. 

Empirical  Analyses  of  Specific  Tasks 

What  can  empirical  analyses  suggest  about  teaching  specific  tasks?  An  obvious 
possibility  is  that  we  might  use  process  models  of  competent  performance  as 
direct  specifications  for  what  to  teach.  Such  models  of  skilled  performance  are 
potentially  powerful  However,  these  alone  do  not  take  into  account  the  capabil- 
ities of  the  learner  as  he  or  she  enters  the  instructional  situation.  1 want  to 
describe  some  experiments  we  heve  done  that  suggest  a more  indirect  relation 
ship  between  what  is  taught  and  how  skilled  performance  proceeds.  The  experi- 
ments suggest  that  what  we  teach  children  and  how  they  perform  a relatively 
iiort  time  after  instruction  are  not  identical-  but  neither  are  they  unrelated 
They  suggest  that  children  seek  simplifying  procedures  that  lead  them  to 
construct,  or  “invent,"  more  efficient  routines  that  might  be  quite  difficult  to 
teach  directly. 
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Subtraction  In  one  study  (Woods,  Resnick,  A Croen,  1975)  we  examined 
simple  subtraction  processes  (e.g.,  5 - 4 = ?)  in  second-  ami  fourth-graders.  The 
method  was  borrowed  from  Groen  and  others’  work  on  simple  addition  pro- 
cesses (Groen  A Parkman,  1972)  and  open-sen  fence  equations  (Groen  A Poll, 
1973).  That  is,  we  gave  children  a set  of  subtraction  problems  to  perform  and 
collected  response  latencies.  Five  possible  models  for  .performing  subtraction 
pioblems  (of  the  form  m — n = ?;  with  0 <1  m < 9, 0 n < 9)  were  hypothesized, 
and  predicted  response  latencies  for  each  problem  for  each  performance  model 
were  worked  out  based  on  the  number  of  steps  that  would  be  required  according 
to  the  model.  Regression  analysis  was  then  used  to  fit  observed  to  predicted 
latency  functions  and  thus  select  the  model  an  individual  child  was  using. 

Of  five  models  tested,  two  accounted  for  the  performance  of  all  but  a few 
subjects: 

Decrementing  model.  Set  a counter  to  m,  decrease  it  n times,  then  ‘'read" 
counter.  For  this  model,  latencies  should  rise  as  a function  of  the  value  of  n.  and 
the  slope  of  the  regression  line  should  reflect  the  speed  of  each  decrementing 
operation.  This  function  is  shown  in  Fig.  6. 


FIG.  6 Plot  of  reaclion  limes  for  second-graders  uifvm*  subtraction  problems  of  ihe  form 
m - ei  ■ Deere  men  tin*  Model  Numbers  bode  solid  dots  de.iole  actual  problems  (e.g.. 
*-54  , 65  signifies  that  problems  5—4  and  6-5  both  had  a mean  tuccrss  la t r ncy  specified  by 
Use  •).  Underlined  problems  were  ormited  in  Use  regression  analysis  (From  "An  experi- 
mental teal  of  five  process  models  for  subtraction"  by  S.  S Woods.  L.  B Resnick.  AG  J 
Groen.  Journal  of  Educational  Psychology.  1975,  67<t>.  tV— 21.  Copyr«ht  1975  by  Use 
American  Psychological  Asvsciabuti  Reprinted  by  permission  ) 
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Choice  model  Depending  on  which  has  fewer  steps,  perform  either  the 
decrementing  routine  (previously  described)  or  another  in  which  a counter  is  set 
10  n and  i«  then  incremented  until  the  counter  reading  matches  m The  number 
of  incrctTR..' . is  then  “read"  as  the  answer  -or  this  model,  it  is  necessary  to 
assume  a process  of  choosing  whether  to  “ir.  umeut  up”  or  “decrement  dowt,. 

We  assume  that  the  choice  process  takes  the  same  amount  of  time  regardless  of 
the  values  of  m and  n.  On  this  assumption,  latencies  should  rise  as  a function  of 
whichever  is  .mailer,  n or  (m  - n).  This  function  is  shown  in  Fig.  7 

Individual  data  were  analyzed  first  ami  a best-fit  model  ulected  for  each  child. 

Then  children  were  grouped  according  to  the  model  they  fit.  and  the  pooled 
data  were  analyzed.  AU  fourth-graders  and  most  second-graders  were  best 
fn  by  the  choice  model  It  seems  unlikely  that  during  their  arithmetic  training 
the  children  had  been  directly  taught  the  choice  model  for  solving  subtraction 
problems  The  procedure  involved  would  be  difficult  to  communicate  verbally  to 
6-  and  7-year  olds,  and  might  confuse  rather  than  enlighten  children  ai  the  point 
of  their  first  exposure  to  subtraction  Most  probably,  the  children  had  been 
taUgh i initially  to  consirud  the  m set  (increment  the  counter  m times),  count 
out  the  n sel  (decrement  n limes),  and  then  count  (“read  out’  ) the  remainder 
This  algorithm  is  close  to  the  one  described  as  the  decrementing  model  The 


FIG  7 Hot  «l  rrjKluvt  nme.  lor  wcund-piden  u’bu'f  uibiiittnei  proNeim  Ounce 
Mo^t  MtN  m m n l reads  "the  urulk  r »( <i  and  •»"  <ii~Ho«ii  An  experiment*!  ic.t  ol 
five  pn**™  model*  to,  rubtrurW  b>  S S »o«dv  L H Remick  k t.  t timer. 
of  IJwrenorm!  *«*>**>  t*75.*7(l>.  17  It  Cupvr>*ht  l*7S  b>  the  Amend*  r»>cho- 
logical  Ahmuiiuk  Reprinted  b>  perrmvuici  I 
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decrementing  model  is  in  fact  derivable  from  the  algorithm  »t  assume  is 
typically  taught,  by  simply  dropping  the  steps  of  constructing  the  m set  and 
actually  counting  the  ret  ander  Thus,  it  seems  reasonable  that  a child  would 
develop  the  decrementing  model  quite  quickly  The  choice  model,  however, 
cannot  oe  derived  from  the  teaching  algorithm  in  so  direct  a way  Instead,  an 
invention  (the  possibility  of  countin'  up  from  n)  must  be  made  This  invention 
is  probablv  based  on  observation  of  the  relations  between  numbers  m addition 
and  subtraction  over  a large  number  of  instances  Yet  the  invention  appears  to 
have  been  ctade  as  early  as  the  end  of  second  grade  b>  most  of  the  children. 

Addition  In  another  study.  Guy  Groen  and  I have  been  looking  more 
directly  at  the  relation  between  the  algorithm  taught  and  bier  performance  In 
the  subtraction  study  we  couid  only  guess  ai  whot  children  had  been  tiughi. 
based  on  our  general  knowledge  of  elementary  school  practice  In  the  addition 
study,  we  coot  -oiled  the  teaching  by  doing  it  ourselves  We  taught  4->  ear  -olds  to 
solve  single-digit  problems  of  the  fom  m ♦ to  * ’ (where  m and  n ranged  from  0 
to  5)  by  using  the  following  algorithm  (a)  count  out  m blocks,  (b)  count  out  n 
blocks,  (r)  combine  the  subsets,  and  id)  count  the  combined  set  We  then 
kept  the  children  coming  back  for  about  two  practice  sessions  i week  for  mans 
weeks.  As  soon  as  each  child  was  performing  the  addition  process  smoothie 
using  blocks,  we  took  the  blocks  away  and  risked  the  children  to  grre  then 
answers  on  a d evict  that  allowed  us  to  collect  latency  data  TV  children' 
typical  response  when  blocks  were  removed  was  to  begm  counting  out  sets  on 
their  lingers  fcventually . however,  most  shifted  to  internal  processing 
fcatlier  vori  by  Suppes  and  Groen  ( 19b  'I  had  shown  that  bv  the  end  of  the 
firs',  grade,  most  children  added  using  a choxe-iype  model  ei  which  thev  set  a 
counter  to  m ot  it.  whichever  was  larger.  n*d  then  incremented  h\  the  smaller  of 
the  iwo  numbers  This  is  known  as  the  mm  jnuimuml  model  j became  the 
latencies  fit  min  (m.  n)|  A few  children  used  a model  of  incrementing  m limes 
then  incrementing  n more  limes,  and  then  reading  the  counter  We  csD  this  the 
sum  model  | latencies  fit  (m  ♦ nil  The  sum  model  can  be  denied  from  the 
procedure  we  taught  by  sinsprs  dropping  steps  tel  and  f /l  of  our  algorithm  and 
it  requires  no  ch-  see  The  mm  model,  hoveiti.  reqwes  an  mvention  bated  on 
the  recognition  that  turns  are  ihe  same  regardless  of  the  order  si  * h «ch  numbers 
are  added,  and  thai  it  is  faster  to  increment  h>  ihe  smaller  quarmtv 
hot  five  of  the  us  children  whose  data  have  been  anah/ed  thus  fat  ii  is  clear 
that  by  the  final  iwo  tesi  sessions  ihe  mm  model  gave  sigmlVant  and  "Ives:  fit 
In  smeral,  the  trend  over  blocks  of  tnals  was  for  subsects  to  be  fit  well  by  the 
min  model  as  toon  as  they  stopped  counting  c eerily  on  must  of  the  tnals  It  ts  as 
if  these  children  discovered  commutativity  as  soon  as  thev  were  confident 
enough  to  stop  counting  on  then  fingers' 

In  the  studies  just  reported  chrfdten  are  taught  a routine  which  rt  derived  Ten 
the  subject  matter  After  some  practice  but  ik»  additional  deect  uvu'mctnes 
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the>  perform  a different  routine,  om  iKji  is  more  effW*«t  The  efftcKtvs  n 4 
muit  of  fnn  steps  , not  app  irenth  (ttln  performance  of  component  opera 
lucti.  which  n turn  rrqutn  s chosce  or  devnaon  on  the  purl  of  the  dirid  A 
stricth  algorithm*.  routine  mi  other  weeds  n couxerted  Mito  Another  rout  nr 
•tuch  turns  out  to  tohe  the  presented  problem  more  effmenth 

A umuar  fndnp  has  been  reported  bs  ttbet  ( 19^2)  m a trads  of  mfo cm* 
uoo-procramj  models  of  class  uvituton  After  hnmj  reversed  t'amng  m the 
pruceun  repealed  bs  jn  nfartmtKn-pocnsnt  suhsts  subjects  ■ere  pc  nrn  ted 
4 tspsvaJ  cbi  tnvhntoct  usi  n which  thrs  were  aslrd  to  led.  foe  mrnffc 
Hhich  is  «aoee  the  red  ones  or  the  trundles’"  Thn  had  been  Uuyfit  to  puss 
through  the  object  urn  truce  each  t me  qumtifiig  the  objects  on  owe  of  the 
drift  Tent  nhc  dtncnuom  runted  and  then  vompunnf  them  to  determine  rthtdi 
»n  more  At  the  post  lest  admuisirred  tmenrduteh  jftet  tn««g  it  mis  found 
that  fume  of  the  children  mere  able  to  perform  the  ta*  more  efficients  bs 
quant  if vrg  on  the  fust  puss  onh  objects  hasrg  onris  one  of  the  danensjons 
ruawi  bs  the  ex  pet  anew  ter  For  example  halier  presented  a subject  with  eght 
trurgies  srtew  of  which  were  red  and  one  preen  Ailed  ~*h*h  is  more  the  red 
ocses  or  the  triagin'  - one  mhjrct  answered  "'There's  one  preen  trunpie  and 
that  if  ties  it  more  trunpies*  Ohahtce  >972.  pp  1 5— 1«»|>  Save  m the  ctasa 
snehruoe  tail  the  set  kmag  onh  one  of  the  aimed  dnsewsicr»,y  n irucdh  thr 
mmor  subset,  thn  procedure  qincih  s «elii  the  answer  It  seems  Uceh  that  a 
phenomenon  of  thn  imj  that  n.  the  ttawdormatt'st  o<  alrorithrm  h the 
inner  n more  fewer*?  tlur  »e  tune  though:  «p  :o  At  least  verse  process 
data  that  appear  ddTW*h  to  aiterpre*  when  averaged  user  taste  mas  dvcw 
auerpeetahae  repuiums  when  ear K aad  Later  phases  of  (effie»aw  are  exam 
esed  separates 

Task  Strietw*  SUW  hrrfoemanee  and  Teach  eg  Root  sn 

9. nut  art  the  eapicatoai  of  Map  of  thn  Laid  lie  nriivtiei’  Or  the  face  ot 
it.  rt  would  teem  that  »e  m^ir  to  abanhm  the  ape  thni.  toutates  vary  cn! 
bs  ntioeul  tail  imS  us  as  tasoe  of  dreecth  iradsg  the  more  errs  t <m»  atalS 
mpesne  pet xe ues  that  appear  to  durctenrc  even  temnldkd  perform***’' 
he  ought  mi  other  wcedv  to  c«**.nde  that  the  ai rival  rativui  anatssev  a- 
wrong,  since  thr , S not  match  sifted  pn*  eauxt  and  th  e rites  d-  rid 
therefore  tot  be  used  mi  nttractvei  Rather  we  ihsril  perform  drtarird  ixget 
ci>  anahses  of  shifted  pertoemaacr  on  a£  ot  the  tasls  that  a cumrulum 
oetgfnev  and  teach  hreth  the  roafnn  wosned  r the  course  of  such 

mh  irs 

Such  a eofi-dusaun,  I bebrte  would  he  mrujwin  It  rests  • -*  the  assumpfw’n 
that  ft  form  munacivm  n nrretianfi  direct  uriractvei  w sidled  perfiemwsce 
strategies  rathe  than  untn*ct*«a  at  rout  uses  that  put  learners  r t p*<J  pvsition 
to  discosrt  or  assent  eflhxent  «rc»ef*es  f<n  tftemsehrv  That  is  what  the  chridrer 
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did  in  the  studies  just  reported.  They  learned  a routine  but  then  invented  a more 
efficient  performance  fo/  themselves.  !t  seems  reasonable  to  suppose— although 
empirical  tests  comparing  different  instructional  strategies  are  needed  tc  draw  a 
strong  conclusion-  that  the  teaching  routines  in  these  studies  were  good  ones, 
because  they  taught  the  specific  skills  in  a way  that  called  upon  children’s 
discovery  md  invention  abilities 

To  put  the  case  m its  most  general  form,  it  would  seem  useful  to  think  in 
terms  of  a "tnangulation"  between  the  structure  of  a task  as  defined  by  the 
subject  matter,  the  performance  of  skilled  individuals  on  a task,  ^nd  a teaching 
acquisition  routine  that  helps  novices  learn  the  task.  There  are  three  terms  in 
this  conceptualization,  all  three  must  stand  in  strong  relation  to  each  of  the 
othen-thus  the  image  of  triangulatian.  This  relationship  is  schematized  in  Fig.  8. 
Most  empirical  information-) processing  analyses  have  been  concerted  with  the 
retailor  ship  between  the  elements  defining  the  base  of  the  triangle-that  is,  with 
the  relanonship  between  the  structure  of  the  subject  matter,  or  “task  environ- 
ment*’ (A),  and  performance  (C).  Thus,  Most  information-processing  task  anal- 
yses are  state  theories,  describing  performance  on  a given  kind  of  task  at  a given 
point  in  learning  or  development,  but  not  attempting  to  account  for  acquisition 
of  the  performance  The  rational  process  analyses  that  we  have  deve'oped  in  the 
course  of  our  instructional  work  have  been  concerned  primai.lv  with  the 
structure  of  the  task  (A)  and  an  idealized  routine  that  represents  the  subject 
matter  well  and  thus  prescribes  a good  teaching  routine  (B).  Our  validation 
studies  have  in  effect  been  tests  of  the  extent  to  which  the  teaching  routines  and 
sequences  derived  through  these  analyses  succeeded  in  conveying  the  subject 


FIG  8 gelations  between  teaching  routines,  performance  routines,  and  structure  of  subject 
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matter  to  learners.  The  discussion  in  the  past  several  pages  has  been  concerned 
with  the  relationships  between  teaching  ioutines(B)  and  performance  routines (C). 
Gaining  understanding  of  the  “transformation”  processes  that  link  these  two 
routines  is  a necessary  step  in  completing  the  triangulation  that  puts  information- 
processing models  into  clear  relation  Jnip  with  instructional  design. 

According  to  this  triangulation”  notion,  there  are  three  criteria  to  be  met  in 
choosing  a teaching  routine: 

1.  It  must  adequately  display  the  underlying  structure  of  the  subject  matter 

2.  It  must  be  easy  to  demonstrate  or  teaeh. 

3.  It  must  be  capable  of  transformation  into  an  efficient  performance  routine 

The  teaching  routine,  then,  is  designed  to  help  facilitate  acquisition.  It  provides 
the  connecting  link  between  the  structure  of  the  subject  matter  and  skilled 
performance- which  is  often  so  elliptical  as  to  obscure  rather  than  reveal  the 
basic  structure  of  the  task. 

Teaching  routines,  in  other  words,  arc  constructed  specifically  to  aid  acquisi- 
tion. The  design  of  teaching  routines  may  require  considerable  artistry,  and  not 
all  routines  will  be  successful  in  meeting  the  criteria  just  laid  out.  Let  us  consider 
*ome  examples.  To  brgin  with  our  own  work,  the  addition  routine  Grocn  and  I 
taught  is  an  insta^ nation  of  the  “union  of  sets”  definition  of  addition.  Thus,  it 
is  a mathematically  “correct"  procedure,  and  represents  the  subject-matter 
structure  clearly.  The  routine  is  also  easy  to  demonstrate  and  to  learn  Our 
4-year-c.d  subjects  (who  knew  only  how  to  count  objects  when  they  began  the 
e, merit nent)  were  performing  addition  virtually  perfectly,  using  the  blocks,  after 
abou.  i Half  hour  of  practice.  The  routine  we  taught  is  awkward  and  slow  to 
perform,  however.  None  of  us  would  like  to  have  to  use  it  in  our  daily  activities, 
and  neither,  apparently,  did  the  4-ycar-olds.  Nevertheless,  the  data  show  that  the 
routine  is  transformable-  by  a series  of  steps  we  ean  imagine  but  cannot  for  the 
moment  document  empirically  -to  the  more  efficient  performance  routine  < 
the  min  model.  Further  this  performance  routine  exemplifies  another  aspect  of 
the  subject-matter  structure,  commutativity.  Thus,  the  proposed  triangulation  is 
completed.  A 'eaehing  routine  derived  by  rational  process  analysis  of  the 
subject-matter  structure  is  transformed  to  a performance  routine  that  reflects  an 
even  more  sophisticated  definition  of  the  subject  matter. 

T ie  ease  is  similar  for  the  subtraction  study.  The  routine  that  we  presume  was 
taught  instantiated  a partitioning-of-sets  definition  of  subtraction.  The  perfor- 
mance routine  derived  by  the  children  is  not  only  more  efficient;  it  also  reflects 
a more  sophisticated  aspect  of  the  subject-matter  structure,  namely  the  complc 
mentary  relationship  betweer,  addition  and  subtraction  operations 
Not  all  teaching  routines  meet  the  criteria  enumerated  above.  Some  are 
nwkward  to  teach;  such  would  be  the  case,  for  example  were  one  to  undertake 
to  teaeh  4-year-olds  the  min  model  for  addition.  Others  fail  to  display  the 
subject-matter  structure  in  a way  that  is  transparent  to  children  This  is  true,  for 
example,  in  the  case  of  traditional  algorithmic  methods  of  teaching  carrying  and 


3.  TASK  ANALYSIS  IN  INSTRUCTION 


75 


borrowing  ihai  do  noi  display  the  underlying  structure  (base  arithmetic  and  ns 
notation)  from  which  the  routines  are  derived. 

Sometimes  instructional  routines  are  developed  in  order  to  display  the  sub- 
ject-matter structure  but  do  not  meet  the  transformability  criterion  -that  is, 
they  arc  not  easily  mapped  onto  a performance  routine  ihat  is  efficient  and 
direct  An  example  of  a performance  routine  that  fails  on  the  criterion  of 
transfonnability  is  one  that  was  proposed  by  Bruner)  1964)  for  leaching  factor- 
ing of  quadratic  expressions  Bruner  was  successful  in  teaching  third-graders  to 
perform  the  facto;  rj  ation  bv  creating  a “model"  of  the  expression  using 
blocks.  As  shown  in  V.\  the  large  square  is  x units  long  and  x wide,  thus 
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TIG  9 (a)  Three  component  for  quadratic  constructions.  (b)  Square*  of  ever  increasing 
size  constructed  with  component.  (From  "Some  iheorems  on  instruction  illustrated  with 
reference  to  mathematics  by  J.  S.  Bruner, in  f.  R.  Ililgard  (Kd.),  Theoritsof  Learning  and 
Instruction  The  63rd  Yearbook  of  the  \SSL  (Pari  1).  Chicago1  Chicago  University  Press, 
1964  C opynght  1964  by  (he  valtonal  Society  for  the  Study  of  Fducatinn.  Reprinted  by 
permission  ) 
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(x2).  The  rod  is  x units  long  and  one  unit  wide,  thus  (x)  The  small  cube  is  7 X 
7,  thus  (7).  As  shown  at  the  right  of  the  figure  (b),  children  can  arrange  these  three 
elements  in  squares  which  will  lave  equal  factors-e  g„  (x  + I)  (or  + I);  (x  + 2) 
(jt  + 2)— and  which  can  also  be  expressed  as  quadratics-e.g.,  (x2  + 2x  + I );  (x2  + 
&x  ♦ 16).  Allowing  children  to  manipulate  the  blocks  may  be  excellent  for 
displaying  and  promoting  insight  into  the  structure  of  the  subject  matter,  but 
there  appears  to  be  no  way  to  transform  the  square-anangement  routine  to  a 
factoring  procedure  used  without  the  blocks. 

Certain  other  leading  routines  in  early  mathematics  do  meet  the  transform- 
ability  criterion  vhile  still  representing  the  mathematical  structure.  For  example, 
measurement  can  be  taught  as  a process  of  dividing  into  equal  units.  Wertheimer 
( 1 959)  did  this  when  he  used  division  of  a figure  into  squares  as  means  of  finding 
its  area.  Bearison  ( 1969),  in  a less  widely  known  experiment,  induced  a general- 
ized conservation  concept  by  showing  children  how  to  count  the  number  of  30-ml 
beakers  that  were  poured  into  beakers  of  different  sizes,  and  demonstrating  the 
principle  of  conservation  by  pouring  equal  quantities  of  liquid  into  containers  of 
different  shapes.  This  generalized  principle  of  measurement,  exemplified  in  the 
liquid  measurement  procedure  taught,  produced  conservation  responses  in  tests 
of  number,  mass,  length,  continuous  and  discontinuous  area,  and  quantity  that 
lasted  for  at  least  six  months.  Similarly,  the  number  base  system  (including 
carrying  and  borrowing)  can  be  taught  using  blocks  in  sizes  of  one,  ten,  and 
one-hundred,  placed  in  units,  tens  and  hundreds  columns  as  in  Fig.  10  (cf. 
Dienes,  1966,  1967a).  With  these  blocks,  carrying  can  be  represented  by  trading 
or  exchanging  extra  (>.e.,  more  than  nine)  blocks  in  a column  for  a larger  block 
that  is  placed  in  the  next  column.  Such  an  exchange  would  be  necessary  for  the 
bottom  display  in  Fig.  10  before  the  block  display  could  be  notated.  A reverse 
exchange  operation  can  be  used  to  represent  subtraction.  In  each  of  'hese  cases, 
as  the  physical  representation  is  dropped,  a performance  routine  can  be  con- 
structed which  initially  performs  "as  if’  the  representation  were  present  and 
then  gradually  becomes  moie  abstracted  from  it.  This  is  the  kind  of  transforma- 
tion we  believe  occurred  in  our  addition  teaching  experiment. 

The  general  suggestion  that  I would  like  to  draw  from  these  observations  is 
that  ',/iost  people-  even  quite  young  children-  use  environmental  feedback  to 
simplify  performance  routines.  They  do  not  accept  the  routines  they  are  shown 
as  "givens"  but  rather  as  starting  points.  They  invent  even  when  we  teach  them 
algorithms.  One  implication  of  this  line  of  thinking  is  that  the  traditional  line 
between  algorithmic  and  inventive  teaching  disappears.  We  are  not  faced  so 
much  with  a choice  between  teaching  by  rules  and  teaching  by  discovery,  as  with 
a problem  of  finding  teaching  rules  that  will  enhance  the  probability  of  dis- 
covery-rules that  somehow  invite  simplification  or  combination  with  other 
rules.  This  way  of  thinking  also  draws  attention  to  the  extent  to  which  we 
presently  depend,  in  our  normal  instructional  practices,  on  this  kind  of  invention 
and  discovery  by  learners.  Our  instruction  is  rarely  complete,  and  rather  than 
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taking  care  to  point  out  the  simplifying  and  organizing  principles  that  underlie 
what  we  teach  we  often  choose  less  than  elegant  instances  and  also  expect 
learners  to  find  the  underlying  principles  for  themselves.  This  suggests  that 
differences  m learning  ability-oflen  expressed  as  intelligence  or  aptitude-may 
" f“'  **  d,ffcrences  ^ 'he  amount  of  support  individuals  require  in  making  the 
simplifying  and  organizing  inventions  that  produce  skilled  performance.  Some 
individuals  vdl  seek  and  find  order  in  the  most  disordered  presentations;  most 
ao  *i.l  ,f  the  presentations  (i.e.,  the  teaching  routines)  are  good  represents 
turns  of  underlying  structures;  still  others  may  need  explicit  help  in  finding 

efficient  strategies  for  performance.  8 


Analyzing  and  Teaching  Generalizes 
Abilities 


'Learning  to  Learn" 


People  apparently  invent  even  within  the  confines  of  algorithmic  instruction 
Nevertheless,  as  just  suggested,  individuals  differ  substantially  in  how  good  they 
are  at  these  inventions.  Thus,  one  appropriate  concern  for  instruction  is  the 
possibility  of  teaching  general  strategies  for  invention  and  discovery-strategies 
that  will  help  learners  to  be  less  dependent  on  the  instructor’s  elegance  in 
presenting  particular  tasks.  An  interest  in  teaching  such  general  “learning  to 
learn  abilities,  as  they  are  often  called,  has  been  widely  expressed  by  educators 
and  psychologists.  However,  few  successes  have  been  reported,  and  there  is  little 
scientific  basis  at  the  present  time  for  such  instruction.  As  in  the  instruction  of 
any  other  ability,  the  first  step  in  teaching  genera!  learning  abilities  is  developing 
a psychological  description-a  task  analysis-of  the  competence  sought.  Such 
analyses  are  only  now  beginning  to  become  available. 

A growing  number  of  information-processing  analyses  of  problem-solving  tasks 
of  various  kinds  provide  a potential  basis  for  instruction.  However  it  is  by  no 
means  evident,  without  further  testing  end  experimentation,  that’ analysis  of 
skilled  per  ormance  on  complex  problems  can  be  directly  translated  into  instruc- 
fional  interventions.  One  test  of  this  possibility  has  been  carried  out  recently  by 
Thomas  Holzman  (1975).  In  an  effort  to  determine  the  instability  of  a 
generalized  pattern  detection  skill,  Holzman  looked  at  an  analysis  of  behavior  on 

^°TTn  t3,SkS  ‘hat  had  bCen  Carr'ed  °Ut  earlier  by  Ko‘°vsky  and  Simon 
fn  h3)'  Tlie.Kot°vsky  and  Simon  analysis  identified  three  principal  subroutines 
r discovering  the  pattern  in  letter  series  completion  tasks  similar  to  those  used 
on  many  intelligence  tests.  These  were  (a)  detecting  the  “period”  of  the 
pattern-that  is,  the  repeating  units  of  a certain  number  of  letters,  such  as  three 
«i  the  pattern  abmcdmefm  ...  or  four  in  the  pattern  defgefghfg ...  (b) 
determining  the  rule  that  generates  each  symbol  in  the  period;  and  (c)  testing  the 
inferred  rule  to  see  if  it  holds  for  all  the  letters  that  have  been  presented  These 
subroutines  in  turn  were  shown  to  be  dependent  upon  recognizing  three  basic 
relations  between  items  in  the  series  presented:  identity  (e.g.,/to  f)\next  in  the 
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alphabet  (e.g.,  f to  g)\  or  backward  next  (e.g.,  h to  g).  These  three  relations 
exhaust  those  that  were  used  in  the  Thurstone  (Thurstone  A Thurstoue,  1941) 
letter  series  completion  task  which  the  Kotovsky  and  Simon  study  used  as  a 
basis,  although  a much  more  extended  and  complex  list  of  relation*,  could  be 
used  in  generating  series  completion  problems. 

Based  on  ibr:  Kotovsky  and  Simon  analysis,  Holzman  taught  children  from 
first  through  sixth  grade  the  strageties  for  recognizing  the  Uiree  basic  relations 
and  for  finding  periods.  Instruction  in  finding  periods  was  (i<.»e  in  such  a way  as 
to  prevent  extrapolation  to  other  subroutines.  Children  trained  iri  these  relations 
and  periodicity  subroutines  improved  significantly  on  the  le*  ter  series  comple- 
tion task  from  pre-  to  posttest.  They  also  improved  significantly  more  than 
control  children  who  simply  took  the  pre-  and  posttest  and  did  not  practice  the 
series  completion  task.  Comparisons  of  particular  types  of  errors  for  the  training 
and  control  groups  showed  that  the  trained  children  improved  significantly  more 
than  the  controls  on  the  more  difficult  relations  (e.g , next  as  opposed  to 
identity ) and  on  the  generally  more  difficult  problems.  Control  children  showed 
a practice  effect,  due  to  experience  with  the  test  itself,  which  was  limited  largely 
to  improvement  on  the  most  easily  detectable  relation  (i.e.,  identity).  This  study 
suggests  that  as  information-processing  analyses  succeed  in  identifying  the 
processes  underlying  problem  solution,  these  processes  -at  least  some  of 
them-  can  be  directly  taught,  and  that  individuals  will  then  be  able  to  apply 
them  to  solving  relatively  large  classes  of  problems. 

What  possibilities  exist  for  analyses  of  problem-solving  abilities  that  are  even 
more  general  than  those  Holzman  found,  and  what  might  these  yield  as  a basis 
for  instruction  that  would  be  truly  generative  of  learning-to  learn  abilities? 
Robert  Glaser  and  I have  considered  this  question  in  another  volume  (Resnick  A 
Glaser,  in  press)  in  which  we  described  several  studies  of  invention  behavior  in 
mathematics  and  related  tasks.  We  argued  that  the  processes  involved  in  problem 
solving  of  certain  kinds  were  probably  the  same  ones  involved  in  learning  in  the 
absence  of  direct  or  complete  instruction,  and  that  instruction  in  those  process 
might  constitute  a means  of  increasing  an  individual's  intelligence. 

A model  of  problem  solving  was  developed  in  which  three  interacting  phases 
were  identified:  (1)  problem  detection,  in  which  the  inapplicability  of  “usual 
routines”  is  noted  and  a problem  or  goal  formulated;  (2)  feature  detection,  in 
which  the  task  environment  (the  external  situation,  which  includes  both  physical 
and  social  features)  is  scanned  for  cues  to  appropriate  responses,  and  (3)  goal 
analysis,  in  which  goals  are  successively  reformulated,  partly  on  the  basis  of 
external  task  cues,  in  order  to  yield  soluble  subgoais  that  contribute  eventually 
to  solution  of  the  task  as  presented.  A study  by  Schadierand  Pellegrino  (1974) 
has  shown  that  requiring  the  subject  to  verbalize  the  goals  of  the  problem  and 
his  or  her  strategies  for  solving  it  before  making  overt  moves  toward  solution 
greatly  enhances  the  likelihood  of  invention.  Along  similar  lines,  it  seems  likely 
that  ways  can  be  found  to  make  individuals  more  conscious  of  the  role  of 
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environmental  cues  in  problem  solving  and  to  tea«.h  strategies  of  feature  ictnnirv 
and  analysis.  Thu  instruction  should  < nhance  the  li^lihood  of  their  noticing  cues 
that  prompt  effective  actions  while  recognizing  and  somehow  “deactivating''  those 
that  prompt  ineffective  actions  Extending  this  ^enereJ  argument  of  self- 
regulation  as  a major  characteristic  of  successful  learning  and  problem 
solving.  Resnick  and  Beck  (in  press)  have  suggested  that  a similar  form  of 
instruction  in  conscious  use  of  self-questioning  and  self-monitoring  strategies 
might  be  an  effective  way  of  enhancing  reading  comprehension  abilities. 

The  specific  suggestions  that  can  be  offered  at  this  time  for  instruction  of 
generalized  learning  abilities  are  limited,  since  relatively  little  work  has  been 
done  thus  far  on  developing  task  analyses  that  characterize  these  general  pro- 
cesses in  mstructable  terms  Rational  analysis  seems  less  likely  to  yield  good 
suggestions  for  generalized  abilities  than  for  specific  tasks,  thus  empmcal  task 
analyses  seem  to  be  called  for  further  the  rigot  of  formal  smrilation  models 
seems  especially  important  wnere  the  processes  are  little  understood  and  the  task 
cnvmv  merits  loosely  structured,  as  is  often  the  case  wheie  problem  solving  and 
discovers-  are  called  for  Thus,  with  respect  to  this  most  unport  ant  goal  of 
instruction,  it  will  probabh  be  necessary  to  engage  in  the  most  costlv  and 
extended  forms  of  task  analysis,  that  is.  those  that  are  formalls  stated  and 
empmcaU)  validated  To  the  extent  that  the  analyses  identify  mstructable 
processes,  instructional  experiments  can  serve  as  one  of  the  man*  forms  of 
empiric:  I validation  of  the  performance  models  proposed  A mutual  interaction 
between  scientific  and  insmictioral  esmeerns  can  thus  be  envisaged  It  is.  then 
with  respect  to  these  general  abilities  in  learning,  thmkmc  and  problem  sob  mg 
that  information -processing  anaksts  mas  ultimaieb  have  the  most  i„  offer  to 
instruction,  and  instructional  effoits  to  psv  choksgical  knowledge 
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INTRODUCTION 

One  can™, » help  but  queUam  the  s*n,fican«  of  fmchoiop’,  contribution 
the  development  of  effect  nr  mUn.ct.onal  procedure!  On  the  one  hand  ptv 
choJop  has  beet,  vrrv  influential  m the  Held  of  educate  In  the  lau  25  vears 
Wmost  eser>  major  innovation  in  educanon  programmed  textbooks.  behavioral 
oVctnrv.  ungraded  schools.  mdrvsduallv  presented  mur uclam.  compote,  man 
aped  and  aimted  inutucnon.  token  economies,  and  tailored  irumf  to  name  a 
fes»  can  be  traced  to  ptvchoiogv  In  nuns  cases  these  n novations  have  rwt 
been  due  to  psvehoftopus  primarrfv  identified  .ith  educatasn  but  rathe,  to 
LsSoratorv  soentius  .hose  research  has  su^eued  ne.  approaches  to  msiroc 
lt<'n  cjn  **  P*1**5  l hat  recxsrd  of  accomplishment  Bur  upon 

dose,  esammatam.  it  « evident  that  these  accxunphshments  are  not  ,»  ckvseh 
hnked  to  psichok^cal  research  as  man,  m*h.  belanr  Psvcholops  has  sup 
ers'ed  ne.  approaches  to  rducafaw  but  these  luppeuaxis  have  not  led  to 
sustained  research  prop, arm  that  have  the  promise  of  prcvjucm*  , truh  effective 
theon  of  muruciam  Rather  psvcholopv  seems  to  provide  the  stunuhn  for 
mnovation.  but  mnovatxm  that  has  not  m turn  led  to  a deeper  undc-siandr*  of 
the  leammp  pr  vrss 

»h>  has  psvchulcpv  not  had  i rtvrr  subuantul  mpuct’  There  are  several 
rexsonr  The  br*htesJ  and  aNest  sounp  psv  cholopw  s usuaFv  are  not  attracted 
educataesal  research  and  the  research  that  has  been  done  tends  to  be 
ptcemeal.  not  purvrmp  problems  m *eal  depth  Vfm  pKture  ma>  chanpe  r,  the 
war  future  due  to  the  limited  number  of  jobs  for  ne.  Ph  I)  * Jnd  to  «x*tv  s 

w*Jr**'  *,r'fc‘,,0•  l 
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mcreastng  emphasis  on  applied  research  The  more  serioui  problem  however  n 
that  pndwkiptti  know  a peat  d>  J about  the  acquisition  of  individual  fact*  and 
siiH*.  but  very  little  about  how  these  combine  to  form  a meaningful  mental 
structure  bffectivt  methods  for  acquiring  skills  and  facts  are  vnportant,  but  the 
mafor  problem  is  the  development  of  knowledge  structures  that  are  more  than 
the  sum  of  individual  facts  In  order  to  deal  effectives  with  educational 
problems,  we  need  theories  that  tell  us  how  knowledge  is  represented  in 
memory,  how  information  is  retrieved  from  that  knowledge  structure,  how  new 
informal n*n  is  added  to  the  structure,  and  how  the  v. stem  can  expand  that 
knowledge  structure  by  seffgeneritnr  processes  The  development  of  such 
theories  is  under  way.  and  tncreasvtgh  work  in  cognitive  psychologs  is  movmg 
■1  that  direction  The  contributtons  o!  Anderson  and  Bower  ( |«>7 3 ».  Newell  and 
Sunon  ( I972f  Kumeihart  Lmdus  . and  Norman  ( |9?2).  and  Schjnk  ( 19^21  are 
examples  of  substantial  efforts  to  develop  comprebensise  theories  of  cognition, 
and  it  is  a Ire  ad  evident  that  thrs  uork  wfl  have  enpbcatiom  for  education 
Such  theorui  will  not  lampts  add  another  urutkle  to  educational  research,  but 
•dl  lay  the  found.*! ions  for  research  encompassing  a larger  ret  of  educationalh 
significant  problems  than  has  been  considered  >i  the  para 

In  this  paper  I w an  I to  review  the  ongomg  work  in  ms  laboratory  that  has 
■optical for  irts  ruction  Some  of  that  work  represents  attempts  to  deal  with 
the  issue  of  complex  knowledge  structures,  whereas  some  is  more  resirictnc 
dealmg  with  the  acquisition  of  specific  tiffs  and  facts  All  o'  the  work  urvobes 
computet -based  programs  of  mslrwction  used  on  a dads  basis  m schsvih  and 
coflepes.  These  programs  can  best  be  described  is  aJtptnr  mirrut  r>  «o»<  nurmt 
8s  that  term  I mean  two  thmgs  1 1 1 the  trqwr*.r  of  nst  rue  larval  actions  taken 
b>  the  program  *anrs  as  a function  of  a given  student  s performance  htstors  and 
I2l  the  program  is  organised  to  wdifs  itieft  automatical^  as  more  students 
complete  the  course  and  then  response  records  idertd s defects  n muructiofui 
riitrpn 

Oa*  work  ou  adaptive  •mtrwctawvW  systems  has  three  fevt  One  n the  develop 
■em  of  a course  m computer  programme*  for  tumor  coAegr  and  college 
ttudrMs  the  secxuid  is  a cvwinc  for  leaching  r adng  n the  fust  three  grades  of 
eirmcMarv  K hoofs.  and  the  thud  » a fore*? -language  vsvabutar  < program  bemg 
■ted  at  the  codrpe  level  llete  I wdl  review  research  on  each  of  there  projects 


INtSTRUCTKX  tN  COWTUTER  PROGRAMMING 

Our  fust  rffetit  ro  leach  ccwpim  ptipumui  •veered  rhe  devekpmrej  of  j 
awpurruMtf  erurwcim  if  All  sumfm  to  teach  the  AllJ  t Aigrtrix 
lih'pttM  (khw**  prsgrimrvug  ■ w •«*«  oune  has  been  red  eaten 
*****  r - •***  '»°*t  cvdkpin  as  » ntridsctoe  to  attguter  pr  grisn 

*amgl8eafd  Ice  toe  VeV  A Afkmtre  |9'l|  livwerer . it  is  a Irsear  Tfrune 
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oncnlcd"  CAI  program  and  does  not  provide  tndrs  dualized  u.ttniction  during 
the  problem-solving  activit)  itself  After  work .ing  through  Icivm  segments  on 
nniu,  exptessaons  etc  , the  Uudent  is  a ssgncvl  a problem  to  sohe  m AID  lie 
mast  then  leave  the  instruct kmijI  program  call  up  a senataie  AID  interpreter 
perform  the  tcqutrrd  programming  talk,  and  return  to  the  imtructwrul  prof  ram 
with  an  answer  Ai  the  Undent  wntei  a program  with  AID.  the  onl>  sources  of 
assistance  are  the  error  mettles  provided  b>  the  nonmstructional  interpretet 
An  inadequao  of  the  AID  course  espeoallv  for  research  purposes,  is  its 
limited  abditv  to  characterize  irvln  dual  uudents'  knowledge  of  specific  skills, 
and  its  mabdits  to  relate  students'  skills  to  the  curricuhim  as  ans thing  more  than 
a ratio  of  problems  correct  10  problems  attempted  The  program  cannot  make 
line  distinctions  between  a Uudent  s urengths  and  weaknesses  and  cannot 
present  muructiorul  material  specifkalls  appropriate  to  that  student  be>ond 
“harder'  or  “easier  lr»onv  In  order  to  expk*r  the  effects  of  different 
curriculum  selection  urategies  w more  detail  we  developed  another  introduc 
tors  programming  course  capable  of  rrpfcsrntaif  both  its  subject  nuttei  and 
student  performance  more  adequate!*  The  internal  lepresenlatson  of  program 
nwng  skilh  and  their  relatiomhgn  ro  the  curriculum  n similar  in  some  *a>s  to 
the  semantic  networks  used  m the  “generative  CAI  programs  developed  bs 
CarKusetl  and  others  1C  uboneil.  |9?0.  Colmv  ( atbonel!.  A Barnock  l*»73» 

T>s*  BASIC  Instruct *on*i  Program 

An  uspoetant  feature  of  a luiorial  CAI  program  n to  provwie  assrstarsce  as  the 
student  attempts  to  solve  a problem  The  program  must  conuai  a repeesentatam 
of  the  subject  matter  that  n complex  enough  to  alow  the  program  to  generate 
appropriate  asuuancr  ai  ans  stage  of  the  under, t s sotatwin  attempt  The  BASK 
iBegmnrrs  AS-p-rp>ne  Ssnvbobo  IwmMiw  Code!  Insinactamal  ProgramtBIPl 
eve  tarns  a representation  of  uifoematam  appeopeute  to  the  tcachag  of  com 
purer  prgramrarag  that  aftows  the  program  both  to  pcovaie  help  to  the  student 
and  to  perform  a imuted  Kit  adevjaate  anahus  of  the  correctness  of  the 
student  s program  as  a schtaw  to  the  grvers  problem 
To  the  student  seated  at  a urmmai  BIP  kv*s  sen  ms«ch  I if  a tvpxal 
tune  sharmg  BASK  opetatmj  svuem  The  BASK  rstetpeetet  written  rspecutv 
for  BIP  anahres  each  pevgraas  brve  after  the  student  tapes  it  arvl  r*>t«f«n  the 
student  of  svetax  etroex  •Ve  the  student  nms  his  or  her  program  it  is  checked 
for  urweturaf  legalities.  aad  dwrwg  ruutarve  executiow"  errors  are  rsivated  A 
fit  s*  rgr  sviarm.  a akxinoi.  ml  utdrts  cvrvuwh  are  nakaNr 
Rrwdmg  above  the  msUted  operate*  tv  stem  n the  “Tutor.-  or  rruructinvil 
peg jus  I IP  I It  ovetVvAs  the  enter  student  BIP  dsakgue  ami  motivates  the 
■vslrwctoeial  ntttx‘t>«  l»  ailrm  to  selecting  i*l  pet  seeing  programme* 
pe  -Merns  to  the  stwdmt  the  IP  vdmurms  the  uudent  s peoblrw  areas,  ogpests 
"stibeaskv.”  fives  ham  or  rrwsdH  cAit.nss  whrst  nreraurt  offers  debug 
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png  anlv  and  supplies  incidental  instruction  in  the  lorm  >1  messages,  interactive 
lessons,  or  manual  references 

At  the  core  of  BIP  is  an  information  network  whose  nodes  are  concepts,  skills, 
problems,  sul-problems.  prerequisites.  BASIC  commands,  remedial  lessons,  hu  ts, 
and  manual  references  The  network  is  used  to  characterize  both  the  logical 
structure  of  the  course  and  our  estimate  of  the  student's  current  slate  of 
knowledge  more  will  be  said  about  the  network  later  bigute  I illustrates  the 
inieractKins  of  the  parts  of  the  BIP  program 

The  ci*’  iculum  is  organized  as  a set  of  programming  problems  whose  text 
includes  onh'  the  description  of  the  problem,  not  lengthy  descriptors  ct 
programming  structures  or  explanations  of  sxntis  There  is  no  fixed  or  iei"g  •.;! 
the  tasks,  the  decision  to  move  from  one  task  to  another  is  made  on  .he  basis  of 
the  information  about  the  tasks  (skills  involved,  pren*quisi’es.  subtask  available! 
stored  in  BIPs  network 

A student  progresses  through  the  cursculum  b*  writing.  ;rv'  t :nn;ng  a 
program  that  solves  the  problem  presented  on  the  terminal  VimiuIIs  ...  iantta 
tions  are  imposed  on  the  amount  of  tune  the  student  sper-1*  the  ivjmbei  of  ! nes 
he  writes,  the  number  of  errors  h*  ts  allowed  to  nuVe  the  number  of  ’irv*-:.he 
chiioses  to  execute  the  program,  etc  The  task  on  wbicfi  .he  stud  . o 

sioed  on  a stack  hk  structure  so  that  he  ma>  cork  oc  n<  thei  toil  for 
whatever  reason  and  return  to  the  previous  task  automancallv  The  omAuforr 
structure  can  accommodate  a wide  va  *tv  of  student  iptitrdr.  ar  d sUdh  Most 
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of  the  curriculum-related  options  are  designed  with  the  less  competent  student 
in  mind  A more  competent  student  may  simply  ignore  the  options.  Thus,  BIP 
gives  students  the  opportunity  to  determine  their  own  "challenge  levels"  by 
making  assistance  available  but  not  inevitable. 

BIP  offers  the  student  considerable  flexibility  in  m.tking  task -related  decisions. 
The  student  may  ask  for  hints  and  subtasks  to  help  solve  the  given  problem,  or 
may  ponder  the  problem,  using  only  the  manual  for  additional  information.  The 
student  may  request  a different  task  by  name  either  completing  the  new  task  or 
not,  as  he  or  she  chooses.  On  the  student’s  return  to  the  original  task,  BIP  tells 
him  or  her  the  name  of  the  again-current  task,  and  prints  the  text  of  the  task  if 
requested.  The  student  may  request  the  model  solution  for  any  task  at  any  lime, 
but  BIP  will  not  print  the  model  for  the  current  task  unless  the  student  has 
exhausted  the  available  hints  and  subtasks.  Taken  together,  the  curriculum 
options  allow  for  a wide  range  of  student  preferences  and  behaviors. 


Th«  Information  Network  of  BIP 

Tnsk  selection,  remedial  assistance,  and  problem  area  determination  require  that 
the  program  have  a flexible  information  store  interrelating  tasks,  hints,  manual 
references,  etc.  This  store  has  been  built  usit'g  the  associative  language  LEAP,  a 
SAIL  (Stanford  Artificial  Intelligence  Laboratory)  sublanguage,  in  which  set,  list 
and  ordered  triple  data  structures  are  available  (Feldman,  Low,  Swinehart,  & 
Taylor,  1972,  Swinhart  & Sproull,  1971;  Vanuehn,  1973).  Figure  2 presents  a 


FIG  2 A legmen t of  BIP  information  network. 
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amplified  relationship  among  a few  programming  concepts,  specific  observable 
skills  that  characterize  the  acquisition  of  the  concepts,  and  programming  prob- 
lems that  require  the  use  of  those  skills.  The  network  is  constructed  using  the 
associative  triple  structure,  and  is  best  described  in  terms  of  the  various  ‘ypesof 
nodes: 


TASKS 


SKILLS 


CONCEPTS: 


OPERATORS: 


HINTS: 


ERRORS 


All  curriculum  elements  exist  as  task  nodes  in  the  network 
They  are  linked  to  each  other  as  subtasks,  prerequisite  tasks, 
or  ‘‘must  follow"  tasks. 

The  skill  nodes  are  intermediaries  between  the  concept  nodes 
and  the  task  nodes  (Fig.  2).  Skills  are  very  specific,  e.g  , 
"concatenating  string  variables”  or  “incrementing  a counter 
variable."  By  evaluating  success  on  the  individual  skills,  the 
program  estimates  competence  levels  in  the  concept  areas.  In 
the  network,  skills  are  related  to  the  tasks  that  require  them 
and  to  the  concepts  that  embody  them. 

The  principal  concept  areas  covered  by  BIP  are  the  following 
interactive  programs;  variables  and  literals;  expressions;  input 
and  output;  program  control- branching;  repetition- loops; 
debugging;  subroutines;  and  arrays. 

Each  BASIC  operation  (PRINT,  LET, . . . ) is  a node  in  the 
network.  The  operations  are  linked  to  the  tasks  in  two  ways: 
either  as  elements  that  must  be  used  in  the  solution  of  the 
problem,  or  as  those  that  must  not  be  used  in  the  solution. 

The  hint  nodes  are  linked  to  the  tasks  for  which  they  may  be 
helpful.  Each  time  a new  skill,  concept  or  BASIC  operator  is 
introduced,  there  is  an  extra  hint  that  gives  a suitable  manual 
reference. 

All  discoverable  syntax,  structural,  and  execution  errors  exist 
as  nodes  in  the  network,  linked  to  the  relevant  “help”  mes- 
sages, manual  references  and  remedial  lessons. 


Clearly  in  some  cases,  a hierarchy  among  skills  or  problems  is  implicit;  more 
frequently,  however,  such  a relationship  cannot  be  ass’  med.  By  imposing  only  a 
very  loose  hierarchy  (e.g.,  requiring  that  all  students  Degin  the  course  with  the 
same  problem),  it  is  possible  to  select  curriculum  an„  provide  assistance  on  the 
basis  of  a student’s  demonstrated  competence  level  on  specific  skills,  rather  than 
on  the  basis  of  a predetermined,  nonindividualized,  sequence  of  problems. 
Students  who  acquire  competence  in  skills  in  some  manner  other  than  that 
assumed  by  subject-matter  experts  to  be  standard  should  benefit  most  from  this 
potential  for  individualization. 

Upon  completion  of  a task,  the  student  is  given  a “post  task  interview”  in 
which  BIP  presents  the  model  solution  stored  for  that  problem.  The  student  is 
encouraged  to  regard  the  model  as  only  one  of  many  possible  solutions  BIP  asks 
the  student  whether  he  or  she  has  solved  the  problem,  then  asks  (for  each  of  the 
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skills  associated  with  the  task)  whether  more  practice  is  needed  on  that  skill  In 
addition  to  the  information  gained  from  this  student  self  analysis,  BIP  also  stores 
the  result  of  a comparison  between  the  student’s  program  and  the  model 
solution,  based  on  the  output  of  both  programs  when  run  on  a set  of  test  data. 
The  student's  responses  to  the  interview  and  the  results  of  the  program  compari- 
son are  used  in  future  BIP-generated  curriculum  decisions.  BIP  informs  the 
student  that  the  task  has  been  completed,  and  either  allows  the  rtudent  to  select 
the  next  task  by  name  (from  an  off-line  printed  list  of  names  and  problem 
texts),  or  makes  the  selection  for  the  student. 

An  example  of  the  role  of  the  1 iformation  Network  in  BIPs  tutorial  capabil- 
ities is  the  BIP-generated  curriculum  decisions  mentioned  above  By  storing  the 
student’s  own  evaluation  o.  his  or  her  skills,  and  by  comparing  the  student’s 
solution  attempts  to  the  stored  models,  BIP  can  be  said  to  “learn"  about  each 
student  as  an  individual  who  has  attained  a certain  level  of  competence  in  the 
skills  associated  with  each  task.  For  example.  BIP  might  have  recorded  the  fact 
that  a given  student  had  demonstrated  competence  (and  confidence)  in  the  skill 
of  assigning  a literal  value  to  a variable  (e  g..  A'  = I),  but  had  failed  to  master  the 
skill  of  incrementing  a counter  variable  (e.g„  A'  = A'+  I ).  BIP  can  then  search  the 
network  to  locate  the  skills  that  are  appropriate  to  each  student's  abilities  and 
present  tasks  that  incorporate  those  skills.  The  network  provides  the  bas  from 
which  BIP  can  generate  decisions  that  take  into  account  both  the  subject  matter 
and  the  student,  behaving  somewhat  like  a human  tutor  in  preseming  material 
that  either  corrects  specific  weaknesses  or  challenges  and  extends  particular 
strengths,  proceeding  into  as  yet  unencountered  areas 
The  BIP  program  has  been  running  successfully  ’vith  both  junior  college  and 
university  students.  However,  the  program  is  still  my  much  in  an  experimental 
stage  From  a psychological  viewpoint,  the  principal  research  issues  deal  with  ( I ) 
procedur  s for  obtaining  on-line  estimates  of  stu.lent  abilities  as  represented  in 
the  information  network,  and  (2)  alternative  methods  for  using  the  current 
estimates  in  the  information  network  to  make  instructional  decisions  Neither  of 
these  issues  is  restricted  to  this  particular  course,  and  a major  goal  in  the 
development  of  BIP  is  to  provide  an  instructional  model  suitable  to  a variety  of 
different  subject  areas  Two  topics  must  be  discussed  in  relation  to  this  goal  the 
naiure  of  appropriate  subject  areas  and  the  general  characteristics  of  the  BIP-like 
stiucture  that  make  it  particularly  useful  in  teaching  such  subjects 
A subject  well  suited  to  this  approach  generally  fits  the  following  description 
it  has  clearly  definable,  demonstrable  skills,  whose  relationships  are  well  known, 
the  real  content  of  the  subject  matter  is  of  a problem-wiving,  rather  than  a 
fact  acquiring,  nature,  the  problems  presented  to  the  student  involve  overlapping 
sets  of  skills,  and  a student’s  solution  to  a given  problem  can  be  judged  as 
adequate  or  inadequate  with  some  degree  of  confidence.  The  BASIC  language,  as 
taught  bv  BIP  is  one  such  subject,  but  the  range  of  appropriate  curriculuins  goes 
well  bevond  the  area  of  computer  science  Foi  example,  elemental  st?.istics 
could  be  taught  by  a similar  approach,  as  could  algebra,  navigation,  accounting. 
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or  organic  chemistry.  All  these  subject  areas  involve  the  manipulation  of  infor- 
mation by  the  student  toward  a known  goal,  all  involve  processes  that  can  be 
carried  out  or  simulated  by  a computer,  and  all  are  based  on  a body  of  skills 
whose  acquisition  by  the  student  can  be  measured  with  an  acceptable  degree  of 
accuracy. 

Because  they  require  the  development  ot  problem-solving  skills,  rather  than 
tire  memorization  of  facts,  these  subject  areas  are  frequently  difficult  to  master 
and  difficult  to  tutor,  especially  using  standard  CAI  techniques.  One  limitation 
of  such  standard  techniques  is  their  dependence  on  a “right”  answer  to  a given 
question  or  problem,  which  precludes  active  student  participation  in  a problem- 
solving process  consisting  of  many  steps,  none  of  which  can  be  evaluated  as 
correct  or  incorrect  except  within  the  context  of  the  solution  as  a whole,  in 
addition,  standard  CAI  techniques  usually  consist  of  an  instructional  facility 
alone-a  mechanism  by  which  information  is  presented  and  responses  are  judged. 
This  facility  can  be  linked  to  a true  problem-solving  facility  that  allows  the 
student  to  proceed  through  the  steps  to  a solution,  but  the  link  does  not  allow 
the  transfer  of  information  between  the  instructional  and  the  problem-solving 
portions  of  the  program.  The  complete  integration  of  the  two  pans  is  ? key 
feature  of  BIP,  making  it  a priate  to  instruction  in  subject  areas  that  have 
been  inadequately  treated  in  caI. 

The  most  general  characteristics  of  the  “network”  structure  include  a represen- 
tation of  the  curriculum  in  terms  of  the  specific  skills  required  in  its  mastery  and 
a representation  of  the  student’s  current  levels  of  competence  in  each  of  the 
skills  he  has  beer,  required  to  use.  Individual  record-keeping  relates  each  stu- 
dent’s progress  to  the  curriculum  at  all  tunes,  and  any  number  of  schemes  may 
be  used  to  apply  that  relationship  to  the  selection  of  tasks  or  the  presentation  of 

additional  information,  hints,  advice,  etc. 

An  important  element  of  our  network  structure  is  the  absence  of  an  estab- 
lished path  through  the  curriculum,  providing  the  built-in  flexibility  (like  that  of 
a human  tutor)  to  respond  to  individual  students’  strengths  and  weaknesses  as 
each  student  works  with  the  course.  This  can  only  be  accomplished  through  a 
careful  analysis  and  precise  specification  of  the  skills  inherent  in  the  subject 
matter,  the  construction  of  a thorough  curriculum  providing  in-depth  experience 
with  all  the  skills,  and  a structure  of  associations  among  elements  of  the 
curriculum  that  allows  for  the  implementation  of  various  instructional  strategies. 
Instructional  flexibility  is  complemented  by  research  flexibility  in  such  a struc- 
ture, because  the  nature  of  the  associations  can  be  modified  for  different 
experimental  purposes.  Once  the  elements  of  the  network  have  been  established, 
it  is  easy,  for  example,  to  change  the  prerequisite  relationship  between  two 
problems,  or  to  specify  a higher  level  of  competence  in  a given  skill  as  a criterion 
measure. 

The  considerable  complexity  involved  in  programming  this  kind  of  flexible 
structure  imposes  a certain  limitation.  Standard  CAI  “author  languages  are  not 
appropriate  to  this  network  approach,  and  constructing  a CAI  course  on  BIP  s 
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pattern  is  not  a task  tc  be  undertaken  by  the  educator  (or  researcher)  who  has 
no  programming  support.  The  usefulness  of  author  languages  is  their  simplicity, 
which  allows  subject-matter  experts  to  prepare  course  material  relatively  quickly 
and  easily.  Most  author  languages  provide  for  alternative  paths  through  a 
curriculum,  for  alternative  answer-matching  schemes,  and  so  forth;  considerable 
complexity  is  certainly  possible.  However,  the  limits,  once  reached,  are  real,  and 
the  author  simply  cannot  expand  the  sophistication  of  his  course  beyond  those 
limits. 

The  programming  support  required  by  the  network  approach,  on  the  other 
hand,  implies  (1)  the  use  of  a general,  powerful  language  allowing  access  to  all 
the  capabilities  of  the  computer  itself,  and  (2)  a programming  group  with  the 
training  and  experience  to  make  full  use  of  the  machine.  It  has  been  our 
experience  that  the  flexibility  of  a general  purpose  language,  while  expensive  in  a 
number  of  ways,  is  worth  the  costs  by  virtue  of  the  much  greater  freedom  it 
allows  in  the  construction  of  the  curriculum  and  the  implementation  of  experi- 
mental conditions.  For  a more  complete  description  of  BIP  and  a review  of  our 
plans  for  further  research  see  Barr,  Beard,  and  Atkinson  (1974). 


INSTRUCTION  IN  INITIAL  READING 
(GRADES  1-3) 

Our  first  efforts  to  teach  reading  under  computer  control  were  aimed  at  a total 
curriculum  mat  would  be  virtually  independent  of  the  classroom  teacher  (Atkin- 
son, 1968).  These  early  efforts  proved  reasonably  successful,  but  it  soon  became 
apparent  that  the  cost  of  such  a program  would  be  prohibitive  if  applied  on  a 
large-scale  basis.  Further,  it  was  demonstrated  that  some  aspects  of  instruction 
could  be  done  very  effectively  using  a computer,  but  that  there  were  other  tasks 
for  which  the  computer  did  not  have  any  advantages  over  classroom  teaching. 
Thus,  during  the  last  fair  years,  our  orientation  has  changed  and  the  goal  now  is 
to  develop  low-cost  CAI  that  supplements  classroom  teaching  and  concentrates 
on  those  tasks  in  which  individualization  is  critically  important.2 

Reading  Curriculum 

Reading  instruction  can  be  divided  into  two  areas  which  have  been  referred  to  as 
“decoding”  and  “communication.”  Decoding  is  the  rapid,  if  not  automatic, 
association  of  phonemes  or  phoneme  groups  with  their  respective  graphic  repre- 

1 A student  terminal  in  the  current  program  consists  only  of  a Model-33  teletypewriter 
with  an  audio  headset.  There  is  no  graphic  or  photographic  capability  at  the  student 
terminal  as  there  was  in  our  first  system,  and  the  character  set  of  the  teletypewriter  includes 
only  uppercase  letters.  On  the  other  hand,  the  audio  system  is  extremely  fcxible  and 
provides  virtually  instantaneous  access  to  any  one  of  6,000  recorded  words  and  messages. 
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FIG.  3 Schematic  presentation  of  the  strand  stricture.  Entry  into  each  strand  depends  on  a 
student's  performance  in  earlier  strands.  The  vertical  dotted  lines  represent  maximal  rate 
contours  which  control  the  student’s  progress  in  each  strand  relative  to  the  other  strands. 


sentations.  Communication  involves  reading  for  meaning,  aesthetic  enjoyment, 
emphasis,  and  the  like.  Our  CAI  program  provides  instruction  in  both  types  of 
tasks,  but  focuses  primarily  on  decoding.  The  program  is  divided  into  eight  parts 
or  strands.  As  indicated  in  Fig.  3,  entry  into  a strand  is  determined  by  the 
student’s  level  of  achievement  in  the  other  strands.  Instruction  begins  in  Strand 
0,  which  teaches  the  skills  required  to  interact  with  the  program.  Entry  into  the 
other  strands  is  dependent  on  the  student’s  performance  in  earlier  strands.  For 
example,  the  letter  identification  strand  starts  with  a subset  of  letters  used  in  the 
earliest  sight  words.  When  a student  reaches  a point  in  the  letter  identification 
strand  where  he  has  exhibited  mastery  over  the  letters  used  in  the  first  words  of 
the  sight-word  strand,  the  student  enters  that  strand.  Similarly,  entry  into  the 
spelling  pattern  strand  and  the  phonics  strand  is  controlled  by  the  student’s 
placement  in  the  sight-word  strand.  On  any  given  day,  a student  may  be  seeing 
exercises  dra'vn  from  as  many  as  five  strands.  The  dotted  vertical  lines  in  Fig.  3 
represent  “miximal  rate  contours,”  which  control  the  student’s  progress  in  each 
strand  relative  to  progress  in  other  strands.  The  rationale  underlying  these 
contours  is  that  learning  particular  material  in  one  strand  facilitates  learning  in 
another  strand;  thus,  the  contours  are  constructed  so  that  the  student  learns 
specific  items  from  one  strand  in  conjunction  with  specific  items  from  other 
strands. 
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The  CAI  program  is  highly  individualized  so  that  a trace  through  the  curricu- 
lum is  unique  for  each  sfudent.  Our  problem  is  to  specify  how  a given  subject  s 
response  history  should  be  used  to  make  instructional  decisions.  The  approach 
ThTwe  have  adopted  is  to  develop  mathematical  models  for  the  acquisition  o 
the  various  skills  in  the  curriculum,  and  then  use  these  models  to  specify  optim 
Suencing  schemes.  Basically,  this  approach  is  what  has  come  to  be  known  m 
the  engineering  literature  as  “optimal  control  theory,  or,  more  simply,  control 
Seo^Tn  the  area  of  instruction,  the  system  to  be  controlled  is  the  human 
Lamer  raiiier  than  a machine  or  group  of  industries.  If  a learning modelcanbe 
specified,  then  methods  of  control  theory  can  be  used  to  derive  opti 

‘tltTflhTopTim^zation  piocedures  will  be  reviewed  later,  but  in  order  for 
a student  makes  an  error,  the  system  responds  with  an  audio  message  and  pr. 


TABLE  1 

Examples  c.«  Two  Exercises  Used  in  Strand  1 1 


Teletypewriter 

display 

Audio 

message 

Copy 

exercise 

The  program  outputs 

PEN 

(Type  pen.) 

The  student  responds  by  typing 

PEN 

(Great!) 

The  program  outputs 
The  program  outputs 

EGG 

(Type  egg.) 

The  student  responds  by  typing 
The  program  outputs 

EFF 

////EGG 

(No,  egg.) 

Recognition  exercise 


The  program  outputs 
The  student  responds  by  typing 
The  program  outputs 
The  program  outputs 
The  student  responds  by  typing 
The  program  outputs 


PEN  NET  EGG 

PEN 

+ 

PEN  EGG  NET 
NET 

+ 


(Type  pen.) 

(Type  net.) 
(Fabulous!) 


“The  top  panel  displays  the  copy  exercise  and  the  bottom  Ut 
recognition  exercise.  Rows  in  the  table  correspond  to  successive 
Lines  on  the  teletypewriter  printout. 
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out  the  correct  response.  In  earlier  versions  of  the  program,  the  student  was 
required  to  copy  the  correct  response  following  an  error.  Experiments  demon 
strated  that  the  overt  correction  procedure  was  not  particularly  effective;  simply 
displaying  the  correct  word  following  an  error  provided  more  useful  feedback. 

Strand  III  offers  practice  with  spelling  patterns  and  emphasizes  the  regular 
grapheme-phoneme  correspondences  that  exist  in  English.  Table  2 illustrates 
exercises  from  this  strand.  For  the  exercise  in  the  top  panel  of  Table  2,  the 
student  is  presented  with  three  words  involving  the  same  spelling  pattern  and  is 
required  to  select  the  correct  one  based  on  its  initial  letters.  Once  the  student 
has  learned  to  use  the  initial  letter  or  letter  sequence  to  distinguish  between 
words,  ht  moves  to  the  recall  exercise  illustrated  in  the  bottem  panel  of  Table  2. 
Here  the  ttudent  works  with  a group  of  words,  all  involving  the  same  spelling 
pattern.  Oil  each  trial  the  audio  system  requests  a word  that  requires  ridding  an 
initial  consonant  or  consonant  cluster  to  the  spelling  pattern  mastered  in  the 
preceding  exercise.  Whenever  a student  makes  a correct  response,  a “+”sign  is 
printed  on  the  teletypewriter.  In  addition,  every  so  often  the  program  will  give 
an  audio  feedback  message;  these  messages  vary  from  simple  ones  like  “great," 

that  s fabulous,”  “you’re  doing  brilliantly,”  to  some  that  have  cheering,  clap- 
ping, or  bells  ringing  in  the  background.  These  messages  are  not  generated  at 
random,  but  depend  on  the  student’s  performance  on  that  particular  day. 

When  the  student  has  mastered  a specified  number  of  words  in  the  sight-word 
strand,  he  or  she  begins  exercises  in  the  phonics  strand;  this  strand  concentrates 
on  initial  and  final  consonants  and  consonant  clusters  in  combination  with 
medial  vowels.  As  in  most  linguistically  orientated  curricula,  students  are  not 
required  to  rehearse  or  identify  consonant  sounds  in  isolation.  The  emphasis  is 
on  patterns  of  vowels  and  consonants  that  bear  regular  correspondences  to 


TABLE  2 

Examples  of  the  Recognition  and  Recall  Exercises  Used  in 
Strand  III  (Spelling  Patterns) 


Teletypewriter 

Audio 

display 

message 

Recognition  exercise 

The  program  outputs  KEPT  SLEPT  CREPT 

The  student  responds  by  typing  KEPT 

The  program  outputs  + 

(Type  kept.) 

Recall  exercise 

The  program  outputs 

The  student  responds  by  typing  CREPT 

(Type  crept.) 

The  program  outputs  + 

(That’s  fabulous!) 
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TABLE  3 

Example*  of  Two  Exerciie*  from  Strand  IV  (Phonics) 


Teletypewriter 

display 

Au<!io 

message 

Recognition  exerdse 

The  program  outputs 

The  student  responds  by  typing 

The  program  outputs 

The  program  outputs 

The  student  responds  by  typing 

The  program  outputs 

-IN-IT-IG 

IG 

+ 

—IT  -IN  -IG 
IT 

(Type  /IG/  as  in  fig.) 
(Good!) 

(Type  /IT/  as  in  fit) 

Build-a-word  exercise 

The  program  outputs 

-IN  -IT  — IG 

The  student  responds  by  typing 
The  program  outputs 
The  program  outputs 

P — 
PIN 
+ 

-IG  -IN  -IT 

(Type  pin.) 
(Great!) 

The  student  responds  by  typing 
The  program  outputs 

F — 

FIN 

////FIG 

(Type  fig.) 

(No,  we  wanted  fig.) 

phonemes.  The  phonic  strand  is  the  most  complicated  one  of  the  group  and 
involves  eight  exercise  formats;  two  of  the  formats  will  be  described  here.  The 
upper  panel  of  Table  3 illustrates  an  exercise  in  which  the  student  is  required  to 
identify  .the  graphic  representation  of  phonemes  occuring  at  the  end  of  words. 
Each  trial  begins  with  an  audio  presentation  of  a word  that  includes  the 
phonemes,  and  the  student  is  asked  to  identify  the  graphic  representation.  After 
mastering  this  exercise  the  student  is  transferred  to  the  exercise  illustrated  in  the 
bottom  panel  of  Table  3 The  same  phonemes  are  presented,  but  now  the 
student  is  required  to  construct  words  by  adding  appropriate  consonants. 


Optimal  Sequences  for  Individual  Students 

This  has  been  a briei  overview  of  some  of  the  exercises  used  in  the  curriculum;  a 
more  detailed  account  of  the  program  can  be  found  in  Atkinson,  Fletcher 
Lindsay,  Campbell,  and  Barr  (1973).  The  key  to  the  curriculum  is  the  optimiza- 
tion schemes  that  control  the  sequencing  of  the  exercises;  these  schemes  can  be 
classified  at  three  levels.  One  level  involves  decision  making  within  each  strand. 
The  problem  is  to  decide  which  items  to  present  for  study,  which  exercise 
ormats  to  present  them  in,  and  when  to  schedule  review.  A complete  response 
history  exists  for  each  student,  and  this  history  is  used  to  make  trial-by-trial 
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decisions  regarding  what  to  present  next.  The  second  level  of  optimization  deals 
with  decisions  about  allocation  of  instructional  time  among  strands  for  a given 
student.  At  the  end  of  an  instructional  session,  the  student  will  have  reached  a 
certain  point  in  each  strand  and  a decision  must  be  made  about  the  time  to  be 
allocated  to  each  strand  in  the  next  session.  The  third  Wei  of  optimization  deals 
with  the  distribution  of  instructional  time  among  students.  The  question  here  is 
to  allocate  computer  time  among  students  to  achieve  fessinWional  objectives 
that  are  defined  not  for  the  individual  student  but  for  the  class  as  a whole.  In 
some  global  sense,  these  three  levels  of  optimization  should  be  integrated  into  a 
unified  program.  However,  we  have  been  satisfied  to  work  with  each  separately, 
hoping  that  later  they  can  be  incorporated  into  a single  package. 

Optimization  within  a strand  (what  has  been  called  Level  1)  can  be  illustrated 
using  the  sight-word  strand.  The  strand  comprises  a list  of  about  1 ,000  words; 
the  words  are  ordered  in  terms  of  their  frequency  in  the  student’s  vocabulary, 
and  word?,  at  the  beginning  of  the  list  have  highly  regular  grapheme— phoneme 
correspondences.  At  any  point  in  time  a student  will  be  working  on  a limited 
pool  of  words  from  the  master  list;  the  size  of  this  working  pool  depends  on  the 
student’s  ability  level  and  is  usually  between  5 and  10  words.  When  one  of  these 
words  is  mastered,  it  is  deleted  from  the  pool  and  replaced  by  the  next  word  on 
the  list  or  by  a word  due  for  review.  Figure  4 presents  a flow  chart  for  the 
strand.  Each  word  in  the  working  pool  is  in  one  of  five  possible  instructional 
states.  A trial  involves  sampling  a word  from  the  working  pool  and  presenting  it 
in  an  appropriate  exercise  format.  The  student  is  pretested  on  a word  the  first 
few  times  it  is  presented  to  eliminate  words  already  known.  If  the  student  knows 
the  word  it  will  be  dropped  from  the  working  pool.  If  not,  the  student  first 
studies  the  word  using  the  recognition  exercise.  If  review  is  required,  the  student 
studies  the  word  again  in  what  is  designated  in  Fig.  4 as  Exercises  4 and  5. 

As  indicated  in  Fig.  4,  a given  word  passes  from  one  state  to  the  next  when  it 
reaches  criterion.  And  this  presents  the  crux  of  the  optimization  problem,  which 
is  to  define  an  appropriate  criterion  for  each  exercise.  This  has  been  done  using 
simple  mathematical  models  to  describe  the  acquisition  process  for  each  exercise 
and  the  transfer  functions  that  hold  between  exercises  (Atkinson  & Paulson, 
1972).  These  models  are  simple  Markov  processes  that  provide  reasonably 
accurate  accounts  of  performance  on  our  tasks.  Parameters  of  the  models  are 
defined  as  functions  of  two  factors:  (1)  the  ability  of  the  particular  student  and 
(2)  the  difficulty  of  the  particular  word.  An  estimate  of  the  student’s  ability  is 
obtained  by  analyzing  his  or  her  response  record  on  all  previous  words,  and  an 
estimate  of  a word’s  difficulty  i:  obtained  by  analyzing  performance  on  that 


FIG.  4 Partial  flow  chart  for  Strand  II  (sight-word  recognition).  The  various  decisions 
represented  in  the  bottom  part  of  the  chart  are  based  on  fairly  complicated  computations 
that  make  use  of  the  student’s  response  history.  The  same  recognition  exercise  is  used  in 
both  stateS,  andS,. 
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pMiculw  word  for  all  students  run  on  the  program.  The  student  record,  are 

Zr? updatcd  ty  thc  romt'uter  “d  “7— 

eStim*teil°/  *ach  stud=":’‘  *bility  factor  and  each  word’s  difficulty 
factor.  Given  a well-defined  model  and  estimates  of  its  parameters  we  can  ul 
the  methods  of  control  theory  to  define  an  optimal  criterion  for  each  exercise 

a?aitTleCveinari "tiT  °"  °f  the  itcm* the  indent’s 

h-th/Tli  d thePrecise  ^quence  of  correct  and  incorrect  responses  made 

scheme  not^  “ irnp0rtan,  *°  that  ^^timization 

bu  TJ  T P?”"  8 Pr0gr4m  based  0n  *•*  “Kent’s  last  response 

, l!rP  °n  ^ ^dent’s  complete  response  nisto^ 

ca^er  in  tfT  d^Tt"  '***  “Ucd  Uvcl  H>  wa*  mentioned 

«r her  in  the  description  of  maximum-rate  contours.  In  some  respects  this 

optumzation  program  is  the  most  interesting  of  the  group,  but  ri  c^not  t 

explained  without  going  into  considerable  mathematical  detail.  In  essence  a 

teaming  model  ,s  developed  that  specifies  the  learning  rate  on  each  strand  as  a 

strands" Ut  T""'  °f  matCrial  that  haS  been  mas,ercd  *"  each  of  the  other 

strategy  is  determined  based  on  the  model.  This  strategy  defines  a closed  lonn 

2“n  ^ ir,it,uctional  Moatiom  for  each  stranS 

based  on  die  best  current  estimate  of  how  much  the  student  has  mastered  in 

itvzi;::  “ ,-ta* f°-  - — ™ 


Optimizing  Class  Performance 


HI  The  UrrT  a"  ?ample  °f  °Ptim“«tion  at  what  has  been  called  l^vel 
L t effeCt,VeneK  of  the  CA:  program  can  be  increased  by  optimally 

Tr8  StUdenh'  SUPPOSC  that  8 ^b00'  bud- 
geted  a uxed  amount  of  time  forCAI  and  mus’  decide  how  to  allocate  that  time 

rv  »£!**•*  rr  F°r  *is  —SSS  ££ 

°f  Pn°'  s,udi"'  ,he  ”)“■>»"  to  ton  developed  ,o 

X',  SmSJ  “ -**  “ " 1 * ■ 


^ ; 0 = d4(0  - fl(t)  exp[-rCtOJ  • 

The  equation .predicts  Student  i’s  performance  on  a standardized  test  as  a 

S’ * 11,6  CAJ  *yStem  during  ***  «*««i  year-  The 

to  ™the?  rie  m a°  Charac,erize  Stude"U  and  vary  from  one  student 
to juiother  These  parameters  can  be  estimated  from  scores  on  reading  readiness 

and  from  the  student’s  performance  during  his  first  hour  of  CAI.  After 
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estimates  of  these  parameters  have  been  made,  the  above  equation  can  be  used 
to  predict  end  of-year  test  scores  as  a function  of  the  CA1  time  allocated  to  that 
student. 

Let  us  suppose  that  a school  has  budgeted  a fixed  amount  of  time  T on  the 
CA1  system  for  a first-grade  class  of  N students;  further,  suppose  that  students 
have  had  reading  readiness  tests  and  a preliminary  run  on  the  CAI  system  so  that 
estimates  of  the  parameters  A,  B,  and  C have  been  made  for  each  student.  The 
problem  then  is  to  allocate  time  T among  the  N students  so  as  to  optimize 
learning.  Jn  order  to  do  this,  it  is  first  necessary  to  have  a model  of  the  learning 
process.  Although  the  above  equation  does  not  offer  t very  detailed  account  of 
learning,  it  suffices  as  a model  for  purposes  of  this  problem.  This  is  an  impor.ant 
point  to  keep  in  mind;  the  nature  of  the  specific  optimization  problem  deter- 
mines the  level  of  complexity  that  needs  to  be  represented  in  the  leaning  model. 
For  some  optimization  problems,  the  nvjdel  must  provide  a relatively  detailed 
account  of  learning  to  specify  a viable  strategy,  but  for  other  problems  a simple 
descriptive  equation  may  suffice. 

In  addition  to  a model  of  the  learning  process,  we  mu>t  also  specify  ar 
instructional  objective.  Only  three  possible  objectives  will  be  considered  here: 

I.  Maximize  the  mean  value  of  P over  the  class  of  students. 

II.  Minimize  the  variance  of  P over  the  class  of  students. 

III.  Maximize  the  mean  value  of  P under  the  constraint  that  the  resulting 
variance  of  P is  less  than  or  equal  to  the  variance  that  would  be  obtained  if 
no  CAI  were  administered. 

Objective  I maximizes  the  gun  for  the  class  as  a whole;  Objective  II  reduces 
differences  among  students  by  making  the  class  as  homogeneous  as  possible;  and 
Objective  III  attempts  to  maximize  the  class  performance  while  insuring  that 
differences  among  students  are  not  amplified  by  CAI.  If  we  select  Objective  1 as 
the  instructional  objective,  then  the  problem  of  deriving  an  optimal  strategy 
reduces  to  maximizing  the  function: 

f[«\)A 2) /(AO)  = Yt { Mi)  - 0(i)expl-/(OO(O  ](. 

/(I)+/(2)+  •••  +*A0  =T. 

where  t(i)  is  the  time  allocated  to  Student  i.  This  maximization  can  be  done 
using  the  methods  of  dynamic  programming.  To  illustrate  the  approach,  compu- 
tations were  made  for  a first-grade  class  for  which  the  parameters  A,  B,  and  C 
had  been  estimated  for  each  student.  Employing  these  estimates,  computations 
were  carried  out  to  determine  the  time  allocations  that  maximized  the  above 
equation.  For  the  optimal  policy,  the  predicted  mean  performance  level  of  the 
class  on  the  end-of-year  tests  v/as  14%  higher  than  a policy  that  allocated  time 
equally  among  students  (i.e.,  an  equal-time  policy,  where  f (0  = T/N  for  all  0- 
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TABLE  4 

Predicted  Percent  Gain  in  the  Mean  of  P and 
in  the  Variance  of  P When  Compared  with  the 
Mean  and  Variance  of  the  Equal-rime  Policy 


Instructional  objective 

1 

I!  Ill 

Gain  in  mein  of  P(%) 

14 

~*5  8 

Gain  in  variance  of  • (%) 

15 

-12  -6 

This  gam  represents  a substantial  improvement;  the  drawback  is  that  the  class 

une  policy.  This  means  that  if  we  are  only  interested  in  raising  the  class  average 

andTet  thT  ‘°  ^ e Tf*  subs,an,ia,,y  more  on  the  CA1  system 

and  let  them  progress  far  beyond  the  slow  learners. 

Although  a time  allocation  that  complies  with  Objective  I does  increase  overall 

das,  performance,  other  objectives  need  to  be  considered  For  compel Time 

allocations  also  were  computed  for  Objectives  II  and  III.  Table  4 presents  the 

Z '?  . ?ain  Cla“  perfonn,nce  as  * P«ccntage  of  the  mean  value  for 

Lr  r r,,cyi  icc,iw  n yie,ded  a nega,ive  *a,u  * ,he  ^ it 

reducl  h i WaS  minimi2e  var*ability . which  is  accomplished  by 

Z \OCaUOnS  ,ean,CrS  Snd  giVing  m°re  a,,en,I0n  *° 

reduction  ,n  variability  for  Objective  II  is  12%.  Objective  III 
which  strikes  a balance  between  Objective  I and  Objective  II,  yields  an  8%  gam 
in  mean  performance  yet  reduces  variability  by  6%. 

and* lasmien V/ofT  ™lM’0,*C,ive  111  WOuld  * Prefcrred  bV  educators 
taininp  l u r Sub*'iI"al  increasc  in  av”age  performance  while  main- 

tonmg  a low  level  of  variability.  These  computations  make  it  clear  tint  the 

*^crion  of  an  instructional  objective  should  not  be  done  in  isolation  but  should 

than  one  “mpara,,ve  anaJys,s  of  *veral  objectives,  taking  into  account  more 

L class  v“,  Perf°rnUnCe  Even  if  ,he  PrinciPl  go-1  « to  maximize 
me  Class  average,  it  is  mappropnate  in  most  educational  situations  to  select 

Objective  , over  111  if  i,  is  only  slightly  better  for  the  class  average  wh"e 
permitting  variability  to  mushroom.3 

Effectiveness  of  the  Reading  Program 

Several  evaluation  studies  of  the  reading  program  have  been  conducted  in  the 

ri  zr  r** ,h"  "*•  ■ ^ ^ it 

some  detail  (Fletcher  A Atkinson,  1972).  In  this  particular  study,  50  pairs  of 
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kindergarten  students  were  matched  on  a number  of  variables,  including  sex  and 
readiness  scores.  At  the  start  of  the  first  grade,  one  member  of  each  pair  was 
assigned  to  the  experimental  group  and  the  other  to  the  control  group.  Students 
in  the  experimental  group  received  CAI,  but  only  during  the  first  grade;  students 
in  the  control  group  received  no  CAI . The  CAI  lasted  approximately  1 5 min  per 
day4;  during  this  period  the  control  group  studied  reading  in  the  classroom. 
Except  for  this  15  min  period,  the  school  day  for  the  CAi  group  was  like  that  of 
the  control  group.  Standardized  tests  were  administered  at  the  end  of  the  first 
grade  and  again  at  the  end  of  the  second  grade.  All  the  tests  showed  roughly  the 
same  pattern  of  results;  to  summarize  the  findings,  only  data  from  the  California 
Cooperative  Primary  Reading  Test  will  l>e  described.  At  the  end  of  the  first 
grade,  the  experimental  group  showed  a 5.05-month  gain  over  the  control  group. 
The  groups,  when  tested  2 year  later  (wtih  no  intervening  CAI  treatment), 
showed  a difference  of  4.90  months.  Thus,  the  initial  difference  observed 
following  one  year  of  CAI  was  maintained,  although  not  amplified,  during  the 
second  year  when  uo  CAI  was  administered  to  either  group. 

No  definitive  conclusions  can  be  drawn  from  evaluation  studies  of  this  sort 
about  the  specific  contributions  of  CAI  versus  other  aspects  of  the  situation. 
Obviously  the  curriculum  materials  used  in  the  CAI  program  are  important,  as 
well  as  other  factors.  To  do  the  type  of  study  that  would  isolate  the  important 
variables  is  too  large  an  undertaking  to  be  worthwhile  at  this  juncture  in  the 
development  of  the  reading  program.  Thus,  to  some  extent  it  is  a matter  of 
judgment  in  deciding  which  variables  account  for  the  differences  observed  in  the 
above  study.  In  my  view,  individualizing  in> ■ ruction  is  the  key  factor  in  success- 
fully teaching  reading.  This  does  not  mean  that  all  phases  of  instruction  should 
be  individualized,  but  certain  skills  can  be  mastered  only  if  instruction  is 
sensitive  to  the  student’s  particular  difficulties.  A reading  teacher  interacting  on 
a one-to-one  basis  with  a student  may  be  more  effective  than  our  CAI  program. 
However,  when  working  with  a group  of  children  (even  as  few  as  four  or  five),  it 
is  unlikely  that  the  teacher  can  match  the  computer's  effectiveness  in  making 
instructional  decisions  over  an  extended  period  of  time 


SECOND-LANGUAGE  VOCABULARY  LEARNING 

In  this  section,  research  on  CA!  programs  for  second-language  vocabulary  learn- 
ing will  be  discussed.  As  noted  elsewhere  in  this  chapter,  the  principal  goal  of 
our  research  on  computerized  instruction  has  been  to  deveiop  adaptive  teaching 
proctdures-procedures  that  make  moment-by-moment  decisions  about  which 
instructional  action  should  be  taken  next  based  on  the  student’s  unique  response 
history.  To  help  guide  the  theoretical  aspects  of  this  work,  some  years  ago  we 

*ln  this  study  no  attempt  was  made  lo  allocate  time  optimally  among  students  in  the 
experimental  group;  rather,  an  equal-time  policy  was  employed. 
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initiated  a reries  of  experiments  on  the  very  restricted  but  well-defined  problem 
of  optimizing  the  teaching  of  a foreign-language  vocabulary.  This  is  an  area 
where  mathematical  models  provide  an  accurate  description  of  learning,  and 
these  models  tan  be  used  in  conjunction  with  the  methods  of  control  theory  to 
derive  precise  algorithms  for  sequencing  instruction  among  vocabulary  items. 
Although  our  original  interest  in  this  topic  was  primarily  theoretical,  the  work 
has  proved  to  have  significant  practical  applications.  These  applications  involve 
computerized  vocabulary  learning  programs  designed  to  supplement  college-level 
courses  in  second-language  instruction.  A particularly  interesting  effort  involves 
a supplementary  Russian  program  in  use  at  Stanford  University.  Students  are 
exposed  to  approximately  1,000  words  per  academic  quarter  using  the  com- 
puter; in  conjunction  with  normal  classroom  work  this  program  enables  them  to 
develop  a substantial  vocabulary.5  Many  foreign  language  instructors  believe  that 
the  major  obstacle  to  successful  insti  action  in  a second  language  is  not  learning 
the  grammar  of  the  language,  but  rather  in  acquiring  a sufficient  vocabulary  so 

that  the  student  can  engage  in  meaningful  conversations  and  read  materials  other 
than  the  textbook. 

In  examining  the  work  on  vocabulary  acquisition  I will  not  describe  the  CAI 
programs,  but  will  review  some  research  on  optimal  sequencing  schemes  that 
provide  the  taeoret.cal  rationale  for  the  programs.  It  will  be  useful  to  describe 
one  experiment  in  some  detail  before  considering  more  general  issues. 


An  Experiment  on  Optimal  Sequencing  Schemes 


In  this  study  a large  set  of  German-English  items  are  to  be  learned  during  an 
instructional  session  that  involves  a series  of  trials.  On  each  trial,  one  of  the 
German  words  is  presented  and  the  student  attempts  to  give  the  English 
translation;  the  correct  translation  is  then  presented  for  a brief  study  period  A 
predetermined  number  of  trials  is  allocated  for  the  instructional  session,  and 
alter  some  intervening  period  a test  is  administered  over  the  entire  vocabulary. 
The  problem  is  to  specify  a strategy  for  presenting  items  during  the  instructional 
session  so  that  performance  on  the  delayed  test  will  be  maximized. 

Four  strategies  for  sequencing  the  instructional  material  will  be  considered. 
Tue  random-order  strategy,  (RO),  is  to  cycle  through  the  set  of  items  randomly 
this  strategy  is  not  expected  to  be  particularly  effective,  but  it  provides  a’ 
benchmark  against  which  to  evaluate  other  procedures.  The  self-selection 
strategy  (SS),  is  to  let  the  student  determine  how  best  to  sequence  the  material. 
In  this  mode,  the  student  decides  on  each  trial  which  item  is  to  be  presented;  the 
learner  rather  than  an  external  controller  determines  the  sequence  of  instruction. 


These  CAI  vocabulary  programs  make  use  of  optima!  sequencing  schemes  of  the  sort  to 
be  discussed I in  this  section,  as  well  as  certain  mnemonic  aids.  For  a discussion  of  these 
mnemonic  aids  see  Raugh  and  Atkinson  (1975)  and  Atkinson  and  Raugh  (1975). 
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The  third  and  fourth  scheu.es  are  based  on  a decision-theoretic  analysis  of  the 
task.  A mathematical  model  that  provides  an  accurate  account  of  vocabulary 
acquisition  is  assumed  to  hold  in  the  present  situation.  The  model  is  used  to 
compute,  on  a trial-by-trial  basis,  an  ! .ividual  student’s  current  state  of  learn- 
ing. Based  on  these  computations,  items  are  selected  for  test  and  study  so  as  to 
optimize  the  level  of  learning  achieved  at  the  termination  of  the  instructional 
session.  Two  optimization  strategies  derived  from  this  type  of  analysis  will  be 
examined.  In  one  case,  the  computations  for  determining  an  optimal  strategy  are 
carried  out  assuming  that  all  vocabulary  items  are  of  equal  difficulty;  this 
strategy  is  designated  OE  (i.e.,  optimal  under  the  assumption  of  equal  item 
difficulty).  In  the  other  case,  the  computations  take  into  account  variations  in 
difficulty  level  among  items;  this  strategy  is  called  OU  (i.e.,  optimal  under  the 
assumption  of  unequal  item  difficulty).  The  details  of  these  two  strategies  will 
be  described  later. 

The  experiment  was  carried  out  under  computer  control;  the  details  of  the 
experimental  procedure  are  given  in  Atkinson  (1972b).  The  students  partici- 
pated in  two  sessions:  an  “instructional  session”  of  approximately  two  hours 
and  a briefer  “delayed-test  session”  administered  one  week  later.  The  delayed 
test  was  the  same  for  all  students  and  involved  a test  over  the  entire  vocabulary. 
The  instructional  session  was  more  complicated.  The  vocabulary  items  were 
divided  into  seven  lists,  each  containing  12  German  words;  the  seven  lists  were 
arranged  in  a round-robin  order.  On  each  trial  of  the  instructional  session  a list 
was  displayed  on  a projection  screen,  and  the  student  inspected  it  for  a brief 
period  of  time;  the  list  involved  only  the  12  German  words  ard  not  their  English 
translations.  Then  one  of  th“  items  on  the  list  was  selected  for  test  and  study.  In 
the  RO,  OE,  and  OU  conditions  the  item  was  selected  by  the  computer;  in  the 
SS  condition  the  item  was  chosen  by  the  student.  After  an  item  was  selected  for 
test,  the  student  attempted  to  provide  a translation  by  typing  it  on  the  computer 
console;  then  feedback  regarding  the  correct  translation  was  given.  The  next  trial 
began  with  the  computer  displaying  the  next  list  in  the  round  robin,  and  the 
same  procedure  was  repeated.  The  instructional  session  continued  in  ibis  fashion 
for  336  trials. 

The  results  of  the  experiment  are  summarized  in  Fig.  5.  Data  are  presented  on 
the  left  side  of  the  figure  for  performance  on  successive  blocks  of  trials  during 
the  instructional  session;  on  the  right  are  results  from  the  test  session  adminis- 
tered one  week  after  the  instructional  session.  The  data  from  the  instructional 
session  are  presented  in  successive  blocks  of  84  trials;  for  the  RO  condition  this 
means  that  on  the  average  each  item  was  presented  once  in  each  o‘‘  these  blocks. 
Note  that  performance  during  the  instructional  session  is  besi  for  the  RO 
condition,  next  best  for  the  OE  condition  which  is  slightly  better  Jian  the  SS 
condition,  and  poorest  for  the  OU  condition.  The  order  of  the  groups  is  reversed 
on  the  delayed  test.  (Two  points  are  displayed  in  the  figure  for  the  delayed  test 
to  indicate  that  the  test  involved  two  random  cycles  through  the  entire  vocabu- 
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FIG.  5 Proportion  of  correct  responses  in  successive  trial  blocks  during  the  instructional 
session  and  or.  the  delayed  test  administered  one  week  later. 


lary;  however  the  values  given  are  the  average  over  the  two  test  cycles.)  The  OU 
condition  is  best  .”ith  a correct  response  probability  of  .79;  the  SS  condition  is 
next  with  .58;  the  0."  condition  follows  closely  at  .54  and  the  RO  condition  is 
poorest  at  .38.  The  observed  pattern  of  results  is  what  one  would  expect.  In  the 
SS  condition,  the  student  are  trying  to  test  themselves  on  items  they  do  not 
know;  consequently,  during  *he  instructional  session,  they  should  have  a lower 
proportion  of  correct  response.'  than  students  run  on  the  RO  procedure  where 
items  are  tested  at  random.  Similarly,  the  OE  and  OU  conditions  involve  a 
procedure  that  attempts  to  identify  and  test  those  items  that  have  not  yet  been 
mastered  and  should  produce  high  error  rates  during  the  instructional  session. 
The  ordering  of  groups  on  the  delayed  test  is  reversed  since  all  words  are  tested 
in  a nonselective  fashion;  under  these  conditions  the  proportion  of  correct 
responses  provides  a measure  of  a student’s  true  mastery  of  the  total  set  of 
vocabulary  items. 

The  magnitude  of  the  effects  observed  on  the  delayed  test  are  of  practical 
significance.  The  SS  condition  (when  compared  to  the  RO  condition)  leads  to  a 
relative  gain  of  53%,  whereas  the  OU  condition  yields  a relative  gain  of  108%.  It 
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the  learning  of  Item  i in  the  vocabulary  set.  The  first  three  parameters  govern  the 
acquisition  process;  the  next  parameter,  forgetting;  and  the  last,  the  student’s 
knowledge  prior  to  entering  the  experiment. 

We  now  turn  to  a discussion  of  how  the  OE  and  OU  procedures  were  derived 
from  the  model.  Prior  to  conducting  the  experiment  reported  here,  a pilot  study 
was  run  using  the  same  word  lists  and  the  RO  procedure  described  above.  Data 
from  the  pilot  study  were  employed  to  estimate  the  parameters  of  the  model; 
the  estimates  were  obtained  using  the  minimum  chi-square  procedures  described 
in  Atkinson  (1972b).  Two  separate  estimates  of  parameters  were  made.  In  one 
c?se  it  was  assumed  that  the  items  were  all  equally  difficult,  and  data  from  all  84 
items  were  lumped  together  to  obtain  a single  estimate  of  the  arameter  vector; 
this  estimation  procedure  will  be  called  the  equal-parameter  (E  case).  In  the 
second  case  the  data  were  separated  by  items,  and  an  estimate  of  the  parameter 
vector  was  made  for  each  of  the  84  items;  this  procedure  wil1  be  called  the 
unequal-parameter  case  (U  case).  The  two  sets  of  parameter  estimates  were  then 
used  to  generate  the  optimization  schemes  previously  referred  to  as  the  OE  and 
OU  procedures. 

In  order  to  formulate  an  instructional  strategy,  it  is  necessary  to  be  precise 
about  the  quantity  to  be  maximized.  For  the  present  experiment  the  goal  is  to 
maximize  the  total  number  of  items  the  student  correctly  translates  on  the 
delayed  test.6  To  do  this,  we  need  to  specify  the  relationship  between  the  state 
of  learning  at  the  end  of  the  instructional  session  and  performance  on  the 
delayed  test.  The  assumption  made  here  is  that  only  those  items  in  State  P at  the 
end  of  the  instructional  session  will  be  translated  correctly  on  the  delayed  test; 
an  item  in  State  T is  presumed  to  be  forgotten  during  the  intervening  week. 
Thus,  the  problem  of  maximizing  delayed-test  performance  involves  maximizing 
the  number  of  items  in  State  P at  the  end  of  the  instructional  session. 

Having  numerical  values  for  parameters  and  knowing  a student’s  response 
history  it  is  possible  to  estimate  the  student’s  current  state  of  learning.7  Stated 

‘Other  measures  can  be  used  to  assess  the  benefits  of  an  instructional  strategy;  for 
example.  in  this  case  weights  could  be  assigned  to  items  measuring  their  relative  importance. 
Also  cos  s may  be  associated  with  the  various  actions  taken  during  an  instructional  session. 
Thus,  for  the  general  case,  the  optimization  problem  involves  assessing  costs  and  benefits 
and  finding  s strategy  that  maximizes  an  appropriate  function  defined  on  them,  i . a 
discussion  of  these  points  see  Dear,  Silberman,  Estavan,  and  Atkinson  (1967),  and  Smal, 
wood  (1962,  1971). 

’The  student’s  “response  history”  is  a record  for  each  trial  of  the  vocabulary  item 
presented  and  the  response  that  occurred.  It  can  be  shown  that  there  exists  a "sufficient 
history”  that  contains  only  the  information  necessary  to  estimate  the  student’s  current  state 
of  learning;  the  sufficient  history  is  a function  of  the  complete  history  and  the  assumed 
learning  model  (Groen  & Atkinson,  1966).  For  the  model  considered  in  this  paper  the 
sufficient  history  is  fairly  simple.  It  is  specified  in  terms  of  individual  vocabulary  items  for 
each  student;  we  need  to  know  the  ordered  sequence  of  correct  and  incorrect  responses  to  a 
given  item  plus  the  number  of  errors  (to  other  items)  that  intervene  between  each 
presentation  of  the  item. 
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more  precisely,  the  learning  model  can  be  used  to  derive  equations  and,  in  turn 
compute  the  probabilities  of  being  in  States  P,  T,  and  U for  each  item  at  the 
start  of  any  trial,  conditionalized  on  the  student’s  response  history  up  to  that 
trial.  Given  numerical  estimates  of  these  probabilities,  a strategy  for  optimizing 
performance  is  to  select  that  item  for  presentation  that  has  the  greatest  proba- 
bility of  moving  into  State  P.  This  strategy  has  been  termed  the  one-stage 
optimization  procedure  because  it  looks  ahead  one  trial  in  making  decisions.  The 
true  optimal  policy  (i.e.,  an  N-t tage  procedure)  would  consider  all  possible 
item-response  sequences  for  the  remaining  trials  and  select  the  next  item  so  as  to 
maximize  the  number  of  items  in  State  P at  the  termination  of  the  instruc- 
tional session.  Unfortunately,  for  the  present  case  the  iV-stage  policy  cannot  be 
applied  because  the  computations  are  too  time  consuming  even  for  a large 
computer.  Monte  Carlo  studies  indicate  that  the  one-stage  policy  is  a good 
approximation  to  the  optimal  strategy;  it  was  for  this  reason,  as  well  as  the 
relative  ease  of  computing,  that  the  one-stage  procedure  was  employed.  For  a 
discussion  of  one-stage  and  A'-stage  policies  and  Monte  Carlo  studies  comparing 
them  see  Groen  and  Atkinson  (1966),  Calfee  (1970),  and  Laubsch  (1970). 

The  optimization  procedure  described  above  was  implemented  on  the  com- 
puter and  permitted  decisions  to  be  made  for  each  student  on  a trial-by-trial 
basis.  For  students  in  the  OE  group,  the  computations  were  carried  out  using  the 
five  parameter  values  estimated  under  the  assumption  of  homogeneous  items 
- case);  for  s^nts  in  the  OU  group  the  computations  were  based  on  the  420 
parameter  values  estimated  under  the  assumption  of  heterogeneous  items  (U 
case). 

The  0U  Procedure  is  sensitive  to  interitem  differences  and  consequently 
generates  a more  effective  optimization  strategy  than  the  OE  procedure.  The  OE 
procedure,  however,  is  almost  as  effective  as  having  the  student  make  his  own 
instructional  decisions  and  far  superior  to  a random  presentation  scheme. 

The  study  reported  here  is  one  in  a series  of  experiments  dealing  with  optimal 
sequencing  schemes.  It  was  selected  because  it  is  easily  described  and  permits 
direct  comparison  between  a learner  -controlled  procedure  versus  procedures 
based  on  a decision-theoretic  analysis.  For  a review  of  other  studies  similar  to 
the  one  reported  above  see  Chiang  (1974),  Delanuy  (1973),  Laubsch  (1970), 
Kimball  (1973),  Paulson  (1973),  and  Atkinson  an.d  Paulson  (1972).  Some  of 
these  studies  examine  procedures  that  are  more  powerful  than  the  ones  de- 
scribed here,  but  they  are  complicated  and  difficult  to  describe  without  going 
into  mathematical  detail.  The  major  improvements  involve  two  facers:  (1) 
methods  for  estimating  the  model’s  parameters  during  the  course  of  insu  tcion, 
and  (2)  more  sophisticated  ways  of  interpreting  the  parameters  of  the  model  to 
take  account  of  both  differences  among  students  and  differences  among  items. 
For  example,  let  P(i,  f)  be  a generic  symbol  for  a parameter  vector  characterizing 
student  / learning  vocabulary  item  /.  In  these  studies  P(i,  j)  U specified  as  a 
function  of  a vector  A(i)  measuring  the  ability  of  student  i and  a vector  DQ) 
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Lorton,  Searle,  and  Atkinson  (1973)  indicate  that  the  learner  is  not  a particu- 
larly effective  decision  maker  in  guiding  the  learning  process. 

Flsewhere  1 have  defined  the  criteria  that  must  be  satisfied  before  an  optimal 
instructional  procedure  can  be  derived  using  formal  methods  (Atkinson,  1972a). 
Roughly  stated,  they  require  that  the  following  elements  of  an  instructional 
situation  be  clearly  specified: 

1 . The  set  of  admissible  instructional  actions. 

2.  The  instructional  objectives. 

3.  A measurement  scale  that  permits  costs  to  be  assigned  to  each  of  the 
instructional  actions  and  payoffs  to  the  achievement  of  instructional  objec- 
tives. 

4.  A model  of  the  learning  proa  ss. 

If  these  four  elements  can  be  given  a precise  interpretation,  then  it  is  usually 
possible  to  derive  an  optimal-instructional  policy.  The  solution  for  an  optimal 
policy  is  not  guaranteed,  but  in  recent  years  powerful  tools  have  been  developed 
for  discovering  optimal,  or  near  optimal,  procedures  if  they  exist.  I will  not 
discuss  these  four  element'  here  except  to  note  that  the  first  three  can  usually  be 
specified  with  a fair  degree  of  consensus.  Issues  of  short-term  versus  long-ten.i 
assessments  of  costs  and  payoffs  raise  important  questions  regarding  educational 
policy,  but  at  least  for  the  types  of  instructional  situations  examined  here 
reasonable  specifications  can  be  offered  for  the  first  three  elements.  However, 
the  fourth  element-the  specification  of  a model  of  the  learning  process- 
represents  a major  obstacle.  Our  theoretical  understanding  of  learning  is  so 
limited  that  only  in  very  special  cases  '■-ii  a model  be  specified  in  enough  detail 
to  enable  the  derivation  of  optimal  procedures.  Until  we  have  a much  deeper 
understanding  of  the  learning  process,  the  identification  of  truly  effective 
strategies  will  not  be  possible.  However,  an  all-inclusive  theory  of  learning  is  not 
a prerequisite  for  the  development  of  optimal  procedures.  What  is  needed  is  a 
model  that  captures  the  essential  features  of  that  part  of  the  learning  process 
being  tapped  by  a given  instructional  task.  Even  models  that  have  been  rejected 
on  the  basis  of  laboratory  investigations  may  be  useful  in  deriving  instructional 
strategies.  Several  of  the  learning  models  considered  in  this  chapter  have  proven 
unsatisfactory  when  tested  in  the  laboratory  and  evaluated  using  standard 
goodness-of-fit  criteria;  nevertheless,  the  optimal  strategies  they  generate  are 
often  quite  effective.  My  own  preference  is  to  formulate  as  complete  a learning 
model  as  intuition  and  data  will  permit  and  then  use  that  model  to  investigate 
optimal  procedures.  When  possible  the  learning  model  should  be  represented  in 
the  form  of  mathematical  equations,  but  otherwise  as  a set  of  statements  in  a 
computer-simulation  program.  The  main  point  is  that  the  development  of  a 
theory  of  instruction  cannot  progress  if  one  holds  the  view  that  a comprehensive 
theory  of  learning  is  a prerequisite.  Rather,  advances  in  learning  theory  will 
affect  the  development  of  a theory  of  instruction,  and  conversely  the  develop- 
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mcnt  of  a theory  of  instruction  will  influence  the  direction  cf  research  on 
learning. 
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A theory  of  instruction  must  be  based  on  the  objectives  of  the  learner  and 1 the 
institutionalized  goals  of  the  instructional  system.  A theory  of  mstruc  ion  m 
S re  t upon  an  adequate  formulation  of  the  psychology  capacity  of  the 
teame^lnstnrctional  goals  Prescribe  the  domain  of 

the  learner-  the  psychological  capacities  of  the  learner  set  limits  on  the  size, 
complexity  of  the^ubtasks  and  the  rate  of  introducing  them  into  the  domain. 

Each  of  the  first  four  chapters  presents  a framework  for  a"a'yzing.tasks 
various  psychological  perspectives.  Carroll  contrasts  strict  behaviorism  with 
currently  nonexistent  cognitive  theory  of  learning.  Calfee  takes  a ^t.stical  vie  . 
Resnick  eclectically  draws  from  Gagnes  hierarchies,  Gestalt  Psychology.  Thor  • 
dike  and  Piaget  to  formulate  a rational  information  processing  scheme.  Atkin 
saS  approach  uses  optimization  procedures  to  guide  instruct.onal  design. 

In  general  instructional  design  attempts  to  organize  subtask  sequences, 
provide  opportunities  for  learning,  and  devise  ways  of  evaiuatmg  the  extent^ 
which  the  learner  acquires  proficiency.  Notice  that  these  genera 
neutral  with  respect  to  methods  ^ 

tt>rK,ha^r^rP."tj  i;T„r=.o,s  „ 

followed  by  Pitigrat ion.  If  Carroll  is  light  in  his  assessment  of  the  role  behavioral 
* ,L  in  education,  behavioral  objectives  will  be  ignored  and  oon 

“„o«en  TOe,  will  become  jus.  on.  more  fad  thrust  on  the  field.  Behavrora 
objectives  may  give  way  to  cognitive  objectives  as  defined  by  Greeno  in  Chapter 
?T3us  right,  the  naive  theories  of  learning  on  which  many  an  artfu 
teachM^edicateslesson  plans  wdl  be  explained  by  cognitive  theory,  bur  no, 
necessarily  extended  by  it. 


I . ? 
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Carroll,  of  course,  is  right— at  least  in  part.  Cognitive  theories  and  information 
processing  models  will  supplant  some  of  the  currently  fashionable  notions  - they 
have  already  done  so.  Furthermore,  there  is  evidence  from  the  set  of  current 
chapters  of  a cumulative  body  of  knowledge  dealing  with  important  issues  of 
task  analysis  and  instruction  that  a lasting  contribution  will  result. 

What  are  the  steps  in  a task  analysis?  First,  we  set  out  to  identify  component 
skills  such  that  mastery  of  the  individual  skill  components  assures  partial  success 
on  the  task.  We  assume  that  the  hierarchy  of  skills  exists  and  that  components 
are  independent.  Calfee  attacks  the  issue  of  independence  directly.  To  the  extent 
that  subprocesses  like  decoding  and  comprehension  can  be  treated  indepen- 
dently, the  analysis  of  components  will  be  successful.  One  direction  that  re- 
search must  take  is  the  identification  of  the  conditions  under  which  tasks  may 
conform  to  a simple  additive  hierarchical  decomposition.  Next  ui  the  sequence 
of  analysis,  we  introduce  components  one  at  a time.  As  Resnick  points  out  in 
Chapter  3 these  components  must  be  teachable.  The  usual  next  step  in  the  task 
analysis  requires  that  we  start  where  the  learner  is  and  build  on  what  he  already 
knows  and  can  do.  Here  the  issues  revolve  around  the  diagnostic  tests  for 
determining  the  initia'  state  of  the  learner,  and  the  development  of  instructional 
materials  that  exercise  the  component.  Instructional  materials,  the  teaching 
routines  that  Resnick  refers  to,  must  provide  a basis  for  integrating  the  new 
learning  with  the  old.  Sut  tLs  third  stage  of  task  analysis  attempts  to  integrate 
the  newly  acquired  skills  intc  ~ ■•eaningful  whole.  Of  special  concern  are  the  real 
time  constraints.  Very  often,  subtasks  that  appear  to  be  well  learned  fall  apart 
when  combined  in  new  ways.  Think  of  the  feeling  of  helplessness  when  you  last 
failed  to  recall  the  name  of  a close  friend  in  performing  introductions  around  a 
group.  Paced  recall  is  not  a cognitive  task  that  we  are  asked  to  perform 
frequently. 

Are  there  cognitive  theories  of  learning  that  can  be  applied  to  the  job  of  task 
analysis?  Carroll’s  review  of  naive,  behav'oristic,  and  cognitive  theories  of  lan- 
guage learning  suggests  that  a cognitive  learning  theory  now  exists.  Unfortu- 
nately, recent  work  on  semantic  memory  (Norman,  Rumelhart,  et  al.,  1975; 
Quillian,  1967,  1968;  Schank,  1972)  has  been  primarily  concerned  with  deter- 
mining the  structure  of  memory  representations.  Research  on  the  processes  of 
understanding  (Hayes  & Simon,  1974  and  this.volume;  Winograd,  1972)  assume 
that  a knowledge  base  is  already  learned  and  available.  Of  course,  Greeno  has 
asserted  that  learning  theory  and  comprehension  theory  defined  in  information 
processing  terms  are  the  same  thing  (Greeno,  1974).  My  own  work  on  sequential 
pattern  learning  (Gregg,  1 967),  verbal  learning  (Gregg  & Simon,  1967),  and  serial 
learning  (Gregg,  1972;  McLean  & Gregg,  1967)  leads  me  to  argue  that  they  are 
not  the  same.  One  difference  is  in  the  development  of  intermediate  structures  in 
working  memory  that  Greeno  has  previously  described.  However,  it  seems 
appropriate  to  stress  the  similarities  between  learning  and  the  processes  of 
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understanding.  Both  depend  on  attention  to  determine  the  final  outcomes  Both 

°f  ihC  e!CmcntS  to  be  ^ated.  The  study  of  comprehension 
and  learn,  nj  is  the  study  of  cognitive  organizations. 

Although  there  is  no  explicit  and  general  cognitive  theory  of  learning,  there 
have  been  isolated  examples  pointing  potential  future  directions.  The  Elemen- 
tary Perce, ver  and  Memorizer,  EPAM,  was  the  first  modern  theory  of  semantic 
memory  and  showed  the  main  outlines  of  associative  memory  structures.  Atkin- 
son  and  Shiffnn  s (1968)  model  for  running  paired  associates  learning  accen- 
tuated the  importance  of  control  processes  in  learning  and  memory.  Just  as  the 
role  of  semantic  memory  emphasizes  structural  aspects  of  representations  of 
knowledge,  snort-term  memory  si  .'dies  emphasize  the  dynamic  operation  of 
eon  ro  processes  if.  specific  tasks.  Learning  and  comprehension,  therefore,  are 
similar  precisely  because  they  come  about  from  the  operation  of  the  human 
information  processing  system.  His  processing  limitations  determine  the  rate  of 
acquisition  of  new  information  and  its  availability  for  use  in  problem  solving. 


INL  EPENDENCE  OF  COGNITIVE  SKILLS 

Are  the  processes  that  the  human  information  processing  system  uses  detectable 
from  an  analysis  of  overt  behavior  in  instructional  task  environments?  This  is  the 
issue  that  Calfee  examines  in  Chapter  2. 

The  method  that  Calfee  proposes  is  aimed  at  finding  out  whether  or  not 
processes  are  independent.  Finding  they  are  not  is  not  very  informative.  Since 
the  method  does  not  generate  process  descriptions,  only  tests  the  effects  of 
them,  one  must  create  variables  to  test  the  hypothesis  that  a process  whose 
operation  may  or  may  not  be  correctly  reflected  by  the  measure,  exists.  There 
are  two  stages  at  cither  of  which  an  error  can  occur.  Each  stage  provides  an 
opportunity  for  error.  One  can  incorrectly  assume  that  a process  exists  that 
ransforms  a data  structure  in  a certain  way.  But  one  can  also  incorrectly  assess 

variate  ^ ^ hyp0theS,Zed  process  by  faUin8  t0  define  a relevant  dependent 

Most  of  the  theorizing  must  go  on  in  advance  of  and  independently  of  the 
testing  operations  suggested  by  Calfee.  Modern  inference  procedures  make  it 
possible  to  carry  out  much  more  powerful  analyses  of  system  interactions. 

Even  though  the  model  of  reading  proposed  by  Calfee  appears  to  be  correct 
the  processes  must  not  remain  independent  for  long.  The  still  higher  order 
cogni  ive  processes  in  speed  reading  and  comprehension  cannot  depend  on 
simple  additive  components  because  selective,  intentional  use  of  strategies  im- 
plies the  constructive  use  of  components. 

We  must  distinguish  between  pedagogically  useful  packets  of  information  and 
the  information  processing  mechanisms  that  may  or  may  not  view  the  materials 
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mechininm  re«Uy  in^ntot?^^nw^  ^ *'  hUm'n 

which  we  show  *quent„l  order  in  the  proceuing  ofTin^LT^  ‘° 


INTERACTIONS  OF  SEQUENTIAL  PROCESSES 

In  «ny  informalion  proceoing  ryrirm,  there  »ai  be  a Uck  of  .uUaicJ 
.moo.  the  mbprocewe,  if  ^ output  of  0M 
later  process  in  the  sequence.  Since  we  believe  th.t «««♦  P10™”  mputs  to  a 

attention  are  seri.l  , , , th,t  most  coF»Hve  task  involving 

p^«.  v “k*  la'  WtUCh  * in'hpm.lennn  of  the 

procesies  is  likely.  The  question  is  not  whether  they  are  independent  for  .h™ 

How*?*0*1  “f^inly  "0t  The  crucial  question  is.  “How  are  they  organized?” 
How  do  we  study  interacting  processes?  What  ferns  of  organization  ex*? 

Ote  of  our  most  popular  notions  of  task  option  JL  tree 3Ure  ^ 

ggest  that  the  prior  elements  are  not  stable  and  must  change  Even  when  thf 
learner  brings  well  developed  skills  to  the  task  there  * ’ t ™ °*e 

Which  the  prior  e.Ln,  .*£ 

wrong0"  afe  ,ncorPorated  *nt°  higher  organizations  whole  cloth  is  probably 

xf sz  rr  m 

'cfldl  lo  Ilur  one  P‘,lem  mlo  another.  New  pattern  result 

* ■ <"■'  of  different,  between  eompetency 
in  early  Hater  of  teaming  and  in  later  rtager.  One  reaton  I believe  „ m„  ,Z 

take  on  th  ^ ,n,truc,|onal  during  execution.  Thus,  mental  skills 

e properties  of  automatic  motor  performance.  For  examnle  inni 

^score  can  be  qu^Le'r^^ 
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In  summary,  we  suppose  that  location  of  the  tests  may  change,  the  size  of  the 
perceptual  or  conceptual  unit  may  vary,  and  that  cues  for  initiating  or  sustaining 
the  behavior  may  be  different  at  different  stages  of  practice. 


INDIVIDUALIZATION  AND  TASK  ANALYSIS 

Atkinson  s approach  stresses  the  idea  that  the  learner  must  be  able  to  follow 
flexible  pathways  through  a subject  domaine.  Optimization  of  CAI  ^an.ing  tasks 
is  based  on  a curriculum  composed  of  a network  of  related  tasks  and  a loose 
collection  of  associated  skills.  In  Atkinson’s  view,  subtasks  and  skills  need  not  be 
related  in  a rigidly  hierarchical  fashion.  This  :n  not  to  say  that  the  instructional 
materials  for  CAI  are  not  carifully  thought  through.  Rather,  the  point  is  that 
individual  learners  can  achieve  higher  performance  levels  in  a variety  of  ways. 
For  the  specific  areas  of  application,  there  is  a “careful  analysis  and  precise 
specification  of  the  skills  inherent  in  the  subject  matter.”  Thus,  for  Atkinson  the 
subject  mutter  defines  the  structure  of  tasks;  the  leaj.ei  acquires  skills  that  are 
inherer  t,  i^., determined  by  the  task  demands. 

Optimization  is  based  on  an  empirical  procedure  to  assess  transfer  of  training. 
Thus,  it  is  possible  for  a student  to  shift  from  one  strand  to  another  based  on 
performance  within  the  first  strand.  Control  of  the  process  is  guided  by  a model 
of  learning  that  provides  a complete  response  history  which  then  in  turn  feeds  a 
Markov  model.  The  optimization  procedures  are  based  on  empirical  results  of 
transfer  and  acquisition  using  simple  models  of  the  performance  during  learning. 
Trial  by  trial  selection  of  items  and  exercises  procedes  from  a determination  of 
each  student’s  ability  and  each  item’s  difficulty.  At  the  most  global  level, 
optimization  is  over  the  allocation  of  student  time  to  each  strand. 

In  these  procedures,  Atkinson  has  captured  the  crucial  questions  of  instruc- 
tional design  that  each  teacher  asks: 

1 . Wh;  items  and  exercises  should  1 give? 

2.  How  should  the  student’s  time  be  distributed  over  the  different  classes  of 
work? 

3.  How  should  the  resources  available  for  instruction  be  allocated  among  the 
students? 

Atkinson  shows  how  far  one  can  go  toward  answering  these  questions  with 
relatively  simple  models  of  performance  during  learning.  His  experiments  on  the 
optimum  sequencing  of  vocabulary  items  indicates  the  kind  of  gains  one  can 
expect  from  the  systematic  selection  and  presentation  of  items  for  study. 
Improvement  of  the  order  of  50  and  100%  are,  indeed,  impressive  results. 
Atkinson  points  out  that  comprehensive  theories  of  learning  are  not  necessary  to 
produce  important  differences  in  rate  and  degree  of  learning. 
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I view  Chapter  4 by  Atkinson  as  a landmark  for  CAI  application',.  In  it  we  see 
three  important  demonstrations  of  the  state  of  the  art  in  appropriate  applica- 
tions of  computer  based  instruction.  First,  the  Basic  In  actional  Program  (BIP) 
represents  an  appropriate  use  of  the  computer  as  a problem  solving  tool.  The 
learner  masters  a programming  language  under  the  direction  of  the  system  which 
assesses  his  or  her  level  of  problem-solving  skill  and  tutorially  guides  the  learner 
through  increasing  levels  of  skilled  performance.  It  has  been  several  years  since  I 
leveled  the  criticism  that  most  CAI  work  did  not  use  computers  in  ways  for 
which  they  were  intended  (Gregg,  1970).  Clearly  that  comment  cannot  be 
applied  to  Atkinson’s  work.  The  second  part  of  Atkinson’s  chapter  describes  his 
analysis  of  the  reading  task  and  it’s  acquisition.  Here  a sophisticated  analysis  of 
the  skills  required  in  reading  is  presented  and  a series  of  exercises  within  each 
strand  defined.  Although  on  a trial  by  trial  basis,  the  model  that  determines  the 
sequence  of  learner  experience  is  a powerful  application  of  decision  theoretic 
ideas  to  transfer  from  strand  to  strand  in  a complex  structure  based  on  task 
analysis.  In  tl  at  ‘arlier  criticism,  I said  that  it  seemed  a waste  to  use  computers 
as  glorified  memory  drums.  Atkinson’s  program  for  the  instruction  in  initial 
reading  bears  no  resemblance  to  so  stupid  a machine.  Perhaps  the  only  legitimate 
criticism  that  remains  is  that  certain  complex  cognitive  tasks  that  require  under- 
standing will  require  complex  cognitive  analyses.  In  the  chapters  hat  follow, 
models  of  semantic  memory,  sentence  comprehension,  and  understanding  writ- 
ten problem  instructions  point  the  way  for  these  now  developments. 

SUBJECT  STRATEGIES  AND  TASK  ANALYSIS 

Resnick’s  review  of  concepts  underlying  the  analysis  of  irstructable  components 
poses  several  challenges.  The  issue  is  how  to  combine  components  logically  to 
produce  the  desired  behavior  of  presumably  higher  complexity.  Instructional 
design  rests  on  the  premise  that  a sequence  of  component  tasks  can  be  identified 
and  then  mastered  in  some  order  to  produce  behavior.  The  integration  can  have 
logical  implications.  The  ideas  of  Osherson  (1974)  on  logical  grouping  capture 
the  hierarchical  nature  of  performance  combinatorics,  but  not  the  order  infor- 
mation that  I have  stressed  so  heavily  in  this  discussion. 

The  reason  I believe  serial  order  is  so  important  is  evident  when  we  consider 
that  a subject  strategy  consists  of  a sequence  of  cognitive  acts  involving  shifts  in 
attention  and  transformations  of  objects.  Thus,  in  any  problem-solving  task  or 
learning  situation,  a complete  description  of  the  psychological  problem  space 
must  include  the  representation  of  the  specific  objects  and  the  set  of  operators 
to  be  applied.  In  a later  chapter,  Simon  and  Hayes  discuss  problem  isomorphs 
where  superficially  distinct  problems  map  onto  the  same  problem  space.  Hence, 
differences  in  problem-solving  performance  can  be  attributed  to  a failure  to  find 


r t'iiT^^BfaiiaiiMaiTi  'frfrr ' ~ iVi  ir  : ~ ii-i  • ■ ...  ..  -i>. 


5.  METHODS  AND  MODELS  115 


an  efficient  representation  for  the  task.  But  here  I am  talking  about  performing 
an  identical  task  where  only  the  instructions  to  the  learner  vary. 

In  a recent,  unpublished  experiment  by  Gerritsen,  Gregg,  and  Simon,  the 
subjects  were  instructed  in  three  strategies.  All  other  conditions  of  the  expen 
men*  were  the  same.  The  strategies,  however,  caused  the  subjects  to  attend  to 
different  aspects  of  the  problem  as  it  was  being  solved.  Both  the  stimulus 
information  and  the  transformation  rules  were  different  for  each  strategy. 

The  experimental  task  was  similar  to  Restle’s  (1970)  serial  pattern  learning 
task.  The  problem  was  displayed  before  the  subject  consisting  of  a digit  from 
oii“  to  six  and  a series  of  3,  4,  or  5 letters  which  stood  for  the  symbolic 
operators:  T was  transpose,  add  one;  M was  mirror,  obtain  the  sevens  compli- 
ment: and  R was  repeat.  The  subjects  responded  by  Dressing  buttons  labeled 
with  the  values  one  to  six  on  a pane!  in  front  of  them. 

The  three  strategies  were  called  the  Doubling,  Recompute,  and  Pushdown 
Methods.  Each  specified  a different  information  processing  sequence  end  each 
resulted  in  quite  different  response  measures  and  error  patterns.  In  fac",  perfor- 
mance on  the  task  varied  by  a factor  of  3 to  1 , from  2244  msec  per  response  for 
the  Recompute  Method  to  721  msec  per  response  for  the  Doubling  Method. 
Such  robust  effects  from  instructions  alone  argue  that  any  attempt  to  identify- 
ing skill  components  must  be  certain  of  their  implementation  in  terms  other 
than  task  variables. 


CONCLUSIONS 

The  moral  of  this  research  story  is  simply  that  subject  strategies  are  crucial  to  an 
understanding  of  cognitive  performance.  One  way  of  viewing  the  entire  problem 
of  instructional  task  analysis  is  to  say  that  the  goal  is  to  specify  a complete  set  of 
subject  strategies  sufficient  to  the  task,  to  map  feasible  strategies  onto  the 
current  information  processing  skills  of  the  learner,  and  finally  to  develop 
instructional  methods  and  materials  such  that  the  learner  acquires  those  strate- 
gies that  have  the  greatest  educational  value. 

Resnick,  in  Chapter  3,  showed  that  children  will  inven  strategies  and  these 
may  be  more  efficient  than  those  derived  by  curriculum  d esigners.  Chapter  7 by 
Greeno  will  show  that  we  can  teach  better  or  worse  st:a*egies.  The  challenge  for 
instructional  design  is  clear. 
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My  purpose  in  this  chaptei  is  to  discuss  five  general  points  presented  in  chapters 
of  Carroll,  Atkinson,  Calfee,  and  Resnick  that  underlie  the  discussion  or  develop- 
ment of  a theory  of  instruction. 

First,  I want  to  indicate  my  opinion  as  to  the  current  status  of  the  enterprise.  1 
believe  that  a theory  of  instruction  is  a legitimate  scientific  goal  and  that  such  a 
theory  has  as  examplars  some  impressive  local  successes  (e.g.,  Atkinson’s  work). 
However,  it  is  easy  to  overestimate  its  current  status.  As  it  now  stands,  the 
theory  of  instruction  is  very  primitive  and  not  at  all  at  a stage  requiring  complex 
mathematical  descriptions  for  its  expression.  Considerations  of  the  nature  of 
performance,  the  nature  of  knowledge,  the  nature  of  the  communicable  and 
instructable,  and  the  nature  of  experience-some  of  which  have  been  raised  by 
Carroll  and  Resnick  as  well  as  by  others  (cf.  Olson  & Bruner,  1974)-are 
conceptual  issues  that  must  be  faced  in  the  attempt  to  construct  such  a theory. 

At  the  more  specific  level  of  optimal  design,  that  is,  research  directed  to  the 
production  of  an  explicit  instructional  program  designed  to  achieve  a particular 
educational  goal,  the  achievements  to  date  are  more  impressive.  Programs  de- 
signed to  achieve  a particular  goal,  such  as  those  described  by  Atkinson,  Resnick, 
and  Calfee  (Chapters  4,  3,  and  2,  respectively,  of  this  volume),  and  others  such 
as  Sesame  Street  (Palmer,  1975)  can  be  described  quite  precisely  and  achieve 
their  goals  quite  successfully. 

But  what  is  the  relation  between  specific  instructional  programs,  the  problem 
of  design,  and  a general  theory  of  instruction?  Atkinson  raised  this  point  in 
regard  to  the  “depth”  of  the  theory  based  on  the  set  of  parameters  found  to  be 
relevant  to  the  optimization  of  a particular  instructional  program.  The  ‘theory’ 
is,  as  I understood  him,  primarily  an  equation  optimizing  the  particular  set  of 
parameters  for  that  set  of  tasks.  The  problem  is  that  that  equation  would  not  be 
generalizable  to  a new,  quite  different,  set  of  tasks.  How  then  do  we  get  to  a 

Preceding  page  blank 


V - ' 


117 


118  DAVID  R.  OLSON 


general  theory  of  instruction  that  would  apply  to  the  vast  arrav  of  thin*.  . 

““f  >°  1>  ■«.  of  le.mm  by  , ™ Ly of  ZZZul.  SS  2 
general  theory  of  instruction  that  we  know  so  little 

J““d  “0<h"  ““  relation  between  panicul,,  procedures  and  Bnm, 

theory  may  I*  concaved  in  ten™  of  the  tele, ion  between ,**.  and ZeZ  t 
the  analysis  of  instruction  are  we  concerned  primarily  either  with  the  probleJn  of 
esigrung  effective  instructional  systems  (cf.  Richards,  1968,  1974)  or  with  the 
construction  of  a theory  that  would  make  explicit  the  nature  imd  colZl  of 

the  experiences  managed  by  formal  institutions  such  as  schools.  And  it  fc  difficult 
to  serve  these  two  goals  at  the  same  time.  is  Oitticult 

Second,  I want  to  consider  what  a “cognitive”  instructional  theory  would  look 

LllT  TP  y ,ad0Pt  “behaVi°ral  °bjeCtives”  °r  if  i"^ctioL  theory t 
concerned  only  with  optimization,  cognitive  theories  are,  to  say  the  least  as 

luxury.  What  could  a cognitive  theory  add?  As  Carroll  printed  ouT  , cation 
meins  knowledge.  A cognitive  theory  would  be  concerned  with  the  nature  of 
human  knowledge,  how  it  is  represented  and  how  it  is  acquired  But  in  instruc 

which""6  7 n°i!  COnC?med  With  311  knowIedge  but  rather  with  that  knowledge 
Which  is  of  such  social  value  that  the  society  creates  an  institution  to  ™te 

knoliT  “d  t0  CVery  Child  *rowin8  “P  i"  that  society.  TZ 
nowledLe  aid  what  are  ths  means  at  our  disposal  for  conveying  it? 

arroll  argued  that  a theory  of  instruction  that  takes  seriously  the  nature  and 
acquisition  of  knowledge  would  make  induction  more  effective.  Calfee  and 
Resnick  by  their  research  programs  show  that  they  agree.  And  they  may  be 

wdll  T8  W°U,d  h®  indist^shable  from  any  behaviorist  or 

r pneer-  Again,  then,  what  could  cognitive  psychology  add’ 

ont.W  7 P?Ch0l°f  COuld  be  concerned,  not  with  prediction  and  control  and 
froTCp";  <7?  “understanding.”  Objective  knowledge,  borrowing 
m lopper  is  a record  of  the  culture’s  solutions  of  imnortant 

fh  Tfh  SUCh’  kn°Wledge  comPlements  the  adaptive  resources  given  to  us 

Ss^  7 HgCneS  3nd  3CqUired  own  personal  expenelces 

is  knowledge  is  coded  in  terms  of  the  symbol  systems  of  the  culture 

^tme  TnyconnttUrf  ^ kn°wled8e  is  shared  “d  therefore  social  in 

dfsUnctiie  from"  h37  ” ""  and  in  social  significance  this  knowledge  is 

nrTT  , , 3CqUired  ™ direct  Poetical  activity;  presumably  the 
processes  involved  in  the  extraction  of  such  knowledge  are  distinctive  and  the 
consequences  of  the  acquisition  of  this  knowledge  are  unique  A theory  of 
instruction,  therefore,  is  needed  for  more  than  simply  irnprZ.ii  tZ  o 

neer to  heip  make  expiidt  **  ^ ~ X £ 

lldren  and  adults  in  the  process  of  socializing  them.  Then  we  will  be  in  a 
' °f  thCSe  6ffeCtS  316  WOrthwhil®  aS  WeU  as  -hich  means 

To  summarize  this  point:  instructional  theory  should  be  concerned  not  only 
wi  optimization,  control,  or  the  achievement  of  behavioral  objectives  but  also^ 
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and  more  basically,  with  making  explicit  the  nature  and  the  consequences  of  the 
transmission  of  socially  useful  knowledge.  Once  these  processes  are  explicit  they 
are  subject  to  rational  consideration,  as  Popper  (1972)  has  argued.  So  we  require 
a theory  of  instruction  that  would  cast  into  theoretical  terms  what  is  already 
going  on  in  the  schools,  and  elsewhere,  in  the  name  of  instruction  and  whit  is 
being  altered  with  the  introduction  of  new  types  of  programs  whether  they  be 
activity  programs,  or  CAI,  or  whatever. 

Third,  I would  like  to  consider  instruction  as  t ie  communication  of  explicit, 
formalizable  routines.  This  point  is  related  to  Resnick’s  interesting  suggestion 
that  good  instruction  ma'7  trade  off  communicability  with  formal  adequacy.  I 
recently  published  an  article  titled  “What  is  worth  knowing  and  what  can  be 
taught?  (Olson,  1 973)’ ' in  which  I argued  that  much  of  the  knowledge  most 
worth  having-making  discoveries,  s,  .iking  convincingly,  writing  effectively,  and 
various  social  and  e'hical  skills— cannot  be  taught  explicitly  because  the  algo- 
rithm underlying  them  (if  indeed  there  are  such  algorithms)  are  not  known. 
Many  that  are  known  are  too  complex  to  communicate  easily  (consider,  for 
example  the  passive  transformation  in  English).  Yet,  in  another  sense,  these 
important  skills  may  be  ‘taught’  by  providing  demonstrations  and  by  providing 
sessions  for  repeated  practice  accompanied  by  appropriate  feedback. 

To  summarize  this  point,  some  knowledge  is  formalizable— an  algorithm  for 
adding,  for  example,  and  can  be  taught  explicitly.  But  other,  more  complex 
skills,  including  many  socially  valued  skills,  cannot  be  taught  explicitly  and  they 
may  have  to  be  ‘taught’  through  demonstrations  or  modeling  and/or  through 
making  allowances  for  learning  through  trial  and  error  or  muddling.  And  a 
theory  of  instruction  is  going  to  have  to  specify  the  nature  and  the  role  and  the 
consequences  of  modeling  and  muddling  as  well  as  the  more  explicit  intervention 
which,  for  the  sake  of  alliteration,  we  may  call  meddling. 

Fourth,  consider  the  relation  between  knowledge  and  skill  or  knowledge  and 
performance.  Carroll  raised  this  issue  in  regard  to  the  linguist’s  distinction 
between  competence  and  performance.  Cognitive  psychology  contrasts  with 
behaviorist  theory  in  assuming  that  knowledge  can  be  wrested  from  the  purpose 
for  which  it  was  acquired  and  the  context  in  which  it  was  acquired  and  cast  into 
more  general  symbolic  form,  thus  rendering  that  knowledge  applicable  to  a 
much  wider  range  of  problems. 

Schools  clearly  got  carried  away  with  the  assumption  that  the  acquisition  of 
theoretical  knowledge , what  Ryle  (1949)  called  propositional  knowledge,  was  the 
sine  qua  non  of  education,  perhaps  giving  some  legitimacy  to  student’s  current 
protest  against  the  value  of  being  stuffed  with  “irrelevan  t”  knowledge. 

This  issue  hangs  critically  upon  an  understanding  of  how  knowledge  relates  to 
performance.  Friere’s  book  The  Pedagogy  of  the  Oppressed  (1972)  makes  the 
case  that  the  only  liberating  knowledge  is  knowle  dge  acquired  by  praxis— action 
coupled  with  reflection.  Perhaps  useful  knowledge  must  he  acquired  in  the 
context  of  action.  Minimally,  a cognitive  theory  of  instruction  must  indicate 
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how  knowledge  is  related  to  performance  both  in  its  acquisition  and  its  subse- 
quent use. 

Fifth  and  finally,  I want  to  argue  in  a preliminary  way  that  the  means  of 
instruction  are  not  simply  instrumentalities  in  instruction.  This  point  is  based  on 
the  concept  of  the  bias  of  communication  (McLuhan,  1964;  Innis,  1951)  The 
way,  the  means  by  which  one  is  taught,  biases  what  is  learned  in  a way  that  has 
largely  escaped  detection.  I have  recently  argued,  (Olson,  1974)  following  the 
leads  of  Havelock  (1974),  Goody  and  Watt  (1968),  Bruner,  (1966)  and  others, 
that  the  particular  reliance  in  our  culture  on  the  use  of  language  out  of  the 
context  of  practical  action  has  put  a distinctive  mark  on  both  our  cultural 
patterns  and  our  cognitive  processes.  Specifically,  the  use  of  written  prose  as  a 
means  of  mstruction  recruits  and  develops  a set  of  mental  competencies  that  are 
general  to  a wide  range  of  intelligent  performances.  To  illustrate,  the  fact  that 
we  learn  Chemistry  from  textbooks  which  utilize  the  peculiar  language  of  explicit 
written  prose  results  not  only  in  some  knowledge  of  chemistry  but  also  of  literary 
skills  of  a high  level.  These  sldUs-the  ability  to  see  the  logical  implications 
of  wmte.i  statements,  and  more  importantly,  the  ability  to  formulate  general 
statemeits  from  which  true  inferences  can  be  drawn-are  mental  skills  of  great 
importar  ce  and  generality.  But  because  they  are  a specialized  set  of  skills,  they  are 
appropriate  only  for  some  kinds  of  tasks;  and  they  may  lead  us  to  undervalue  other, 
equally  important  but  different  skills  such  as  those  involved  in  the  arts  or,  for  that 
matter,  those  involved  in  common-sense  judgments.  Instructional  theory  will  have 
to  account  for  the  nature  of  means  before  it  can  be  regarded  as  a general  theory. 

It  is  this  somewhat  vague,  general,  and  preliminary  level  of  conceptual  analysis 
that,  I believe,  will  yield  new  understanding  of  ihe  nature  of  instruction. 


Part  II 

PROCESS  AND  STRUCTURE 

IN  LEARNING 


As  the  anatomist,  with  his  microscopical 
study  of  the  stomach,  may  finally  suggest 
the  ways  for  cooking  more  digestible  food, 
so  the  experimental  psychologist  will  com- 
bine and  connect  the  detailed  results  more 
and  more,  till  he  is  able  to  transform  his 
knowledge  into  practical  educational  sug- 
gestions. . . . Single  disconnected  details  are 
of  no  value  for  such  a practical  transforma- 
tion; and  even  after  all  is  done,  this  more 
highly  developed  knowledge  will  be  but  a 
more  refined  understanding  of  qualitative 
relations- never  the  quantitative  measure- 
ment which  so  many  teachers  now  hopefully 
expect  [Miinsterberg,  1898J . 
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Cognitive  Objectives 
of  Instruction:  Theory  of 
Knowledge  for  Solving  Problems 
and  Answering  Questions 

James  G.  Greeno 

The  University  of  Michigan 

A great  deal  of  progress  has  been  made  in  recent  years  toward  the  understanding 
of  many  cognitive  processes.  Psychological  theories  that  have  been  developed 
and  tested  deal  with  perception,  memory,  thnking,  and  language  processing  at  a 
level  of  detail  and  specificity  ‘hat  is  an  order  of  magnitude  beyond  the  theoreti- 
cal concepts  available  only  a few  years  ago. 

My  purpose  in  this  chapter  is  to  show  how  some  of  this  body  of  theory  can  be 
used  in  the  formulation  of  instructional  objectives.  The  motivation  for  this  is 
quite  simple.  The  goal  of  instruction  is  that  students  should  acquire  knowledge 
and  skills  of  various  kinds.  A rich  set  of  conceit  s has  been  developed  in  scientific 
psychology  that  can  be  applied  to  analyze  the  structure  of  knowledge  and 
cognitive  skills.  Thus,  it  should  be  possible  using  those  concepts  to  carry  out 
analyses  of  the  knowledge  and  skill  that  are  desired  as  outcomes  of  instruction. 
It  may  be  expected  that  the  explicit  statement  of  instructional  objectives  based 
on  psychological  theory  should  have  beneficial  effects  both  in  desigr  of  instruc- 
tion and  assessment  of  student  achievement.  The  reason  is  simple:  we  can 
generally  do  a better  job  of  accomplishing  something  and  determining  how  well 
we  have  accomplished  it  when  we  have  a better  understanding  of  what  it  is  we 
are  trying  to  accomplish. 

The  view  I am  taking  has  much  in  common  v/ith  the  opinions  of  many 
educational  psychologists  (such  as  Anderson  & Faust,  1973)  who  recommend 
that  instructional  goals  be  formulated  as  behavioral  objectives.  In  the  view  taken 
here,  development  of  instructional  objectives  begins  with  consideration  of  the 
kinds  of  tests  used  to  assess  whether  students  have  acquired  the  knowledge 
intended  as  the  outcome  of  learning.  But  rather  than  just  specifying  the  behav- 
iors needed  to  succeed  on  such  tests,  cognitive  objectives  are  developed  by 
analyzing  the  psychological  processes  and  structures  that  are  sufficient  to 
produce  the  needed  behaviors. 

Preceding  page  blank 
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There  is  an  important  psychological  assumption  implicit  in  the  position  taken 
here.  I am  assuming  that  the  goals  of  instruction,  including  aspects  of  conceptual 
understanding,  can  be  inferred  from  the  tasks  that  students  are  expected  to 
perform  during  instruction  and,  following  instruction,  on  tests.  If  this  is  ac- 
cepted, then  it  follows  that  a theory  specifying  cognitive  structures  and  pro- 
cesses sufficient  to  perform  those  tasks  is  a candidate  hypothesis  about  what  the 
instruction  is  trying  to  produce.  Of  course,  any  candidate  that  is  proposed  can 
and  should  be  questioned  regarding  issues  of  substance.  I am  confident  that  the 
specific  features  of  the  objectives  I will  present  here  can  be  improved,  although  I 
have  tried  to  incorporate  reasonable  psycholo  jical  assumptions  into  these  illus- 
trative casts.  However,  the  general  kind  of  description  offered  here  should  be 
taken  as  a completely  serious  proposal  about  what  the  goals  of  instruction  are 
like.  It  may  be  that  when  we  see  what  kinds  of  cognitive  structures  are  needed  to 
perform  criterion  tasks,  we  will  conclude  that  something  important  is  missing; 
but  if  that  is  the  case,  it  also  will  be  important  to  identify  a more  adequate  set  of 
criterion  tasks  in  order  to  ensure  that  instruction  is  promoting  the  structures  we 
think  are  important. 

I have  chosen  three  substantive  domains  in  which  to  develop  illustrative 
cognitive  objectives  of  instruction.  The  first  is  elementary  material  in  fourth- 
grade  fractions;  the  second  is  introductory  material  in  high-school  geometry;  and 
the  third  is  some  material  from  introductory  college  psychology  dealing  with 
auditory  psychophysics.  None  of  these  is  developed  to  anything  near  a complete 
and  detailed  set  of  objectives;  however,  I hoj'e  that  I have  developed  a suffi- 
ciently specific  example  in  each  case  to  make  the  enterprise  credible.  One  reason 
for  choosing  these  three  examples  is  that  they  r present  instruction  carried  out 
at  widely  different  age  levels.  I believe  that  ou  current  stock  of  concepts  and 
techniques  in  cognitive  psychology  is  adequate  1 3 the  task  of  analyzing  instruc- 
tional objectives  from  elementary  school  through  college,  and  my  choice  of 
examples  is  meant  to  back  up  that  belief. 

A related  point  about  the  choice  of  examples  is  that  they  illustrate  some 
important  broad  relationships  between  knowledge  that  is  imparted  to  students 
of  different  ages.  The  knowledge  needed  to  do  computations  with  fractions 
seems  to  involve  a simple  kind  of  algorithmic  skill  that  can  be  expressed  easily 
with  flow  charts.  Suppes  and  Momingstar  (1972)  developed  similar  models  for 
analysis  of  addition,  subtraction,  and  multiplication;  Suppes  (1969)  has  called 
these  automaton  models.  Tasks  used  for  instruction  in  high-school  geometry 
require  a more  complicated  set  of  procedures  and  knowledge  structures.  Knowl- 
edge that  is  required  for  geometry  can  be  represented  as  a production  system, 
including  mechanisms  that  are  found  in  current  theories  of  problem  solving  (e.g., 
Newell  & Simon,  1972)  for  setting  goals  and  searching  in  a problem  space.  The 
problem-solving  system  uses  numerous  procedures  of  the  kind  taught  to  elemen- 
tary-school students,  so  knowledge  of  the  first  kind  is  embedded  in  the  more 
complex  structures  required  for  the  more  mature  learning. 
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The  instructional  objectives  for  college  psychology  seem  to  require  still  an- 
other level  of  complexity.  Understanding  auditory  psychophysics  requires  ac- 
quisition of  a complex  network  of  concepts  of  the  kind  we  are  familiar  with  in 
theories  of  semantic  memory  (Anderson  & Bower,  1973;  Kintsch,  1974;  Nor- 
man, Rumelhart,  & Group,  1975;  Quillian,  1968)  and  performance  on  many 
criterion  tasks  (such  as  essay  examinations)  requires  a procedure  for  generating 
paragraphs  in  answer  to  complex  questions.  We  are  just  now  beginning  to 
explore  the  kinds  of  cognitive  capabilities  needed  to  produce  structured  verbal 
output  at  the  level  of  paragraphs  (e.g.,  Abelson,  1973;  Crothers,  1972;  Frederik- 
sen,  1972;  Rumelhart,  1975).  The  mechanisms  of  generating  explanations 
apparently  share  significant  features  with  mechanisms  of  generating  solutions  for 
problems,  but  there  are  also  significant  differences,  due  at  least  in  part  to  the 
more  open-ended  quality  of  the  task. 


EXAMPLE  1:  FRACTIONS1 

Much  of  the  work  on  fractions  required  of  students  involves  carrying  out 
calculations  such  as  finding  equivalent  fractions,  adding  and  subtracting  frac- 
tional numbers,  and  finding  common  denominators.  Ability  to  carry  out  those 
calculations  is  a minimal  objective  of  instruction,  and  it  can  be  represented  in  a 
psychological  theory  as  a flow  chart  showing  the  component  processes  of  the 
procedure.  In  general,  the  procedure  is  not  unique-there  are  more  ways  than 
one  to  calculate  the  correct  answer.  Alternative  procedures  can  be  represented  in 
different  models,  or  incorporated  in  a single  nondeterministic  model  that  allows 
different  branches  to  be  taken. 

Procedural  Representation  of  a Concept 

Figure  1 shows  a procedure  for  adding  two  fractions.  The  upper  part  of  the 
diagram  shows  a procedure  for  finding  fractions  with  a common  denominator 
that  are  equivalent  to  the  numbers  given  in  the  problem.  The  lower  part  sketches 
the  operation  of  checking  the  answer  and  reducing  if  necessary. 

If  the  procedure  shown  in  Fig.  1 is  accepted  as  a psychological  model  of  adding 
fractions,  then  it  is  a candidate  for  an  instructional  objective  in  the  elementary 
mathematics  curriculum.  The  concept  of  adding  fractions  is  a procedure,  and  a 
goal  of  instruction  is  to  have  students  acquire  that  procedure  as  part  of  their 
cognitive  equipment.  The  idea  of  representing  a concept  as  a cognitive  procedure 

1 1 have  been  privileged  to  participate  in  a number  of  discussions  of  children’s  learning  of 
fractions  with  my  colleague  Joseph  Payne  and  his  students.  Many  of  the  opinions  and 
judgments  that  I have  about  fractions  have  been  developed  in  those  discussions,  although 
Payne  and  his  group  surely  should  not  be  held  responsible  for  faults  in  my  understanding. 
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FIG.  1 Procedure  for  addition  of  fractions. 

is  familiar  in  recent  theoretical  work.  Examples  include  Hunt’s  (1962)  analysis 
of  categorical  concepts  as  procedures  for  classifying  stimuli,  and  Winograd’s 
(1972)  theory  of  language  understanding,  where  concepts  are  procedures  for 
identifying  objects  and  answering  questions  about  their  locations. 


Cognitive  Representations  of  Quantity 

Procedures  like  Fig.  1 can  compute  answers,  but  they  lack  conceptual  under- 
standing of  a kind  that  many  educators  would  want  students  to  acquire.  Texts 
such  as  the  one  by  Payne,  May,  Beatty,  Weils,  Spooner,  and  Dominy  (1972) 
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include  numerous  exercises  like  the  one  shown  in  Fig.  2,  for  which  the  student  is 

to  fill  in  the  blanks  of" out  of pieces  are  shaded.”  Other  exercises  present 

sets  of  discrete  objects,  such  as  a row  of  circles,  some  of  which  are  colored 

differently  from  the  others,  with  a question  “ out  of  circles 

are  red.”  The  intent  is  for  students  to  begin  by  seeing  fractional  quantities 
represented  pictorially,  as  parts  of  regions  or  as  subsets. 

An  important  issue  in  the  theory  of  mental  computation  is  the  way  in  which 
quantitative  information  is  represented.  A procedure  like  Fig.  1 is  neutral  with 
regard  to  the  representation  of  quantity  — that  is,  quantities  could  be  repre- 
sented in  a variety  of  ways  and  the  procedure  could  be  designed  to  work  on  any 
of  them.  I suggest  that  the  instructional  objective  reflected  in  exercises  like  Fig. 
1 can  be  represented  in  a theory  about  the  ways  in  which  fractional  quantities 
are  represented. 

I will  distinguish  here  among  three  representations  of  quantity;  there  probably 
are  more,  but  these  seem  to  be  the  main  possibilities  involved  in  elementary 
instruction.  The  first  representation  is  just  an  ordered  alphabet  of  numerals. 
Students  can  count,  and  the  list  produced  by  counting  provides  a precedence 
relation  on  the  numbers.  Basic  operations  of  addition,  subtraction,  multiplica- 
tion, and  division  may  also  be  stored  as  relations  on  numbers-for  example,  “five 
times  three  equals  15,”  may  be  in  cognitive  structure  as  a sequence  of  verbal 
associations. 

A second  representation  involves  actual  or  imagined  quantity,  the  number  of 
items  in  a set.  A system  could  be  designed  with  a counting  mechanism  for 
assigning  a number  to  any  set,  but  with  operations  of  addition,  multiplication, 
and  so  on  carried  out  on  sets  rather  than  directly  on  numbers.  For  example,  to 
find  5X3,  imagine  a set  of  five  objects,  then  imagine  three  of  those  sets,  and 
count  the  total  number  of  objects  in  the  three  sets. 

A third  representation  uses  geometric  forms,  and  quartity  is  represented  as  the 
spatial  extent  of  a form.  Addition  and  subtraction  can  be  represented  as  moving 
to  the  right  or  left  on  a number  line.  To  multiply  5X3,  imagine  a rectangle 
divided  vertically  into  five  sections,  where  the  measure  of  each  section  is  taken 
to  be  one.  Then  the  rectangle  is  nride  thiee  times  as  large-imagine  two  more 
rectangles  just  below  the  first  one  with  the  three  rectangles  concatenated, 
forming  a single  large  rectangle.  The  total  measure  of  the  large  rectangle  is  15,  as 


FIG.  2 Diagram  representing  a fraction  spatially 


128 


JAMES  G.  GREENO 


could  be  confirmed  by  counting  all  the  sections  of  size  equal  to  the  original 
sections. 

It  would  not  be  realistic  to  suppose  that  arithmetic  operations  are  generally 
carried  out  by  anyone  as  operations  on  sets  or  regions.  No  one  multiplies  9X7 
by  imagining  nine  objects,  each  reproduced  seven  times,  and  counting  the  total; 
we  remember  that  9 X 7 is  63.  However,  teachers  and  writers  of  texts  apparently 
feel  that  it  is  useful  to  introduce  procedures  for  manipulating  quantity  as 
operations  on  spatial  representations  or  on  representations  of  sets.  If  Piaget 
(1965)  is  correct,  addition  and  multiplication  of  numbers  depend  on  the  same 
basic  cognitive  operations  as  additive  and  multiplicative  combinations  of  sets 
(and,  we  might  suppose,  regions— though  that  may  involve  some  additional 
sophistication  about  space  and  geometry;  see  Piaget,  Inheldrr  & Szeminska, 
1960).  With  or  without  Piaget’s  theory,  it  is  reasonable  to  as  ’i*  that  students’ 
acquisition  of  basic  arithmetic  concepts  is  aided  by  connecting  those  concepts 
with  operations  on  sets  and  spatial  quantity , since  they  have  observed  many  of 
those  changes  in  their  experience. 

When  fractions  are  introduced,  they  can  be  related  either  to  diagrams  showing 
geometric  shapes  divided  into  pieces,  or  to  diagrams  showing  sets  of  objects  of 
different  kinds.  Fractions  can  also  be  defined  for  the  students  as  a combination 
of  operations  including  a multiplication  and  a division  (Dienes,  1967b),  although 
this  would  generally  be  done  in  connection  with  diagrams  involving  sets  or 
regions.  1 will  not  try  to  present  a full  analysis  here  of  all  the  relative  advantages 
of  'ariovs  ways  of  presenting  fractions.  My  goal  will  be  to  show  that  the 
cognitive  representation  of  quantity  can  play  an  important  role  in  procedural 
representations  of  mathematical  concepts.  I believe  that  the  formulation  of 
psychological  theories  involving  different  representations  gives  some  new  clarity 
to  the  issues  involved  in  choosing  a way  to  present  mathematical  concepts,  and  I 
will  illustrate  this  with  some  discussion  of  the  issue  regarding  fractions.  However, 
this  discussion  should  be  seen  as  an  illustration  of  a way  in  which  cognitive 
objectives  can  be  used  in  discussion  of  alternative  instructional  methods,  rather 
than  firm  advocacy  of  a particular  method. 

Alternative  Objectives  with  Differing  Representations 

The  plan  of  the  rest  of  this  section  is  as  follows.  I will  present  three  models  that 
find  equivalent  fractions  when  the  denominator  of  the  fraction  to  be  computed 
is  a factor  or  a multiple  of  the  fraction  given.  One  of  the  models  incorporates  a 
process  of  generating  a unit  region  and  operates  on  that  region  by  forming 
subregions  eithei  by  subdividing  or  collecting  pieces  of  the  region.  A second 
model  uses  a process  of  generating  sets,  forming  subsets  by  partitioning  the  set 
and  generating  members.  The  third  model  is  a simple  algorithm  for  computing 
equivalent  fractions  using  operations  of  multiplication  and  division  defined  on 
numbers.  The  reason  for  developing  these  models  is  to  show  how  theoretical 
analysis  of  a task  can  prov  en’  specific  psychological  characterizations  of  alterna- 
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FIG  3 Procedure  for  finding  equivalent  fractions,  using  spalial  represenlalion  of  fraclional 
quanlily 


live  instructional  goals.  I will  also  give  some  tentative  discussion  regarding 
implications  of  the  different  models  for  acquiring  other  concepts  related  to 
equivalent  fractions. 

A model  that  uses  spatial  processing  of  a region  is  shown  in  Fig.  3,  and  rig. 
shows  traces  of  the  program  as  it  solves  two  problems.  This  model  and  the  ones 
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5 ' 15 


6 _ x 
15  ' 5 


FIG.  4 Traces  of  the  procedure  for  equivalent  fractions  using  spatial  representation. 


that  follow  assume  that  the  problem  has  been  formulated  so  that  ore  denomina- 

not  sa^dPth°  0thCr’  3nd  lHe  anSWer  is  “ inte8er-  If  these  conditions  are 
Fiil  3 and  4 X P/"CeSSeS /eturn  Mure-  The  representation  of  fractions  in 
mfi'nt  * ! 6 °T  Sh°Wn  in  Fi«-  2> 10(1  a reasonable  curriculum  would 

use  introductory  exercises  like  Fig.  2 as  preparation  for  learning  the  structure  of 

Figure  5 shows  a model  of  finding  equivalent  fractions  using  a process  that 
includes  a mechanism  for  generating  sets,  members  of  sets,  and  subsets.  A trace 
ot  die  program  s solution  of  two  problems  is  shown  in  Fig.  6. 

A third  model  of  finding  equivalent  fractions  is  given  in  Fig.  7.  This  represents 
a unple  algorithm  for  finding  the  correct  answer,  without  the  involvement  of 
any  imagery  or  diagrams. 
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HG.  8 Trace,  of  the  procedure  for  equivalent  fractions  using  set-theore.ic  representation. 


SOmC  fCatUreS  0f  Process*ng  assumed  in  these  mod 
are  based  on  my  mtuiUon  rather  than  on  any  data  that  I am  aware  o7f 

example,  the  first  step  in  the  problem  6/15  = XI S in  Fig.  5 when  it  noted  th 
c < b is  to  imagine  (or  draw)  a set  having  c subsets  as  elements.  In  effect  I ha 
assumed  that  a subject  generates  abstract  place-holders  that  wdl  becom  « 
when  dements  are  generated.  There  are  alternative  models  thaTare^so  p,au 

F°r  exfmple>  one  might  assume  that  initially  a set  with  b elements 
generated;  then  they  are  grouped  into  subsets  having  M = b/c  elements  in  eat 
“teli  to.oftte  subsets  „ Jkedt  ItaXT,!  f„  * 

Sho«™„  Fig.  5 rather  dun  the  tnode,  sketched  .bTvI'ist  ‘ 
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Two  comments  should  be  made  about  these  arbitrary  features  of  processing 
that  are  incorporated  in  theories  presented  here.  First,  in  many  cases  experi- 
mental tests  of  the  models  could  be  developed  to  distinguish  between  alternative 
models,  if  it  were  considered  important  to  distinguish  between  them  empirically. 
Second,  it  may  not  be  critical  to  distinguish  between  models  differing  in 
processing  details  if  the  details  lack  important  implications  for  quality  of  student 
performance  in  instructional  situations,  or  the  ability  of  students  to  progress  to 
further  stages  of  knowledge  and  understanding.  The  variations  that  I have 
thought  of  within  the  three  classes  of  models  that  1 have  presented  seem 
relatively  unimportant  to  me  as  regards  instructional  implications. 

However,  1 think  the  differences  among  the  three  models  diagrammed  in  Figs. 
3,  5,  and  7 probably  are  significant  in  connection  with  students'  ability  to  use 
concepts  of  fractional  quantity  in  later  learning  and  in  situations  that  arise  in 
experience.  It  is  a reasonable  hypothesis  that  procedures  like  those  of  Figs.  3 and 
S are  important  in  applying  fractions  in  situations  that  arise  in  experience.  The 
argument  is  as  follows:  in  situations  involving  continuous  quantity  (such  as 


start:  Identify 

a,  b,  c,  x in  j~  = | 


compute 
M = c * b 


compute 


Is  M arfy2£_ 
Jntegerj/ 


= M x a 


FIG.  7 Procedure  for  finding  equivalent  iractions,  operating  directly  on  numencal 
representations. 
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fractions  of  cups  of  su  stances  used  in  cooking)  and  in  situations  involving  sets 
of  discrete  units  (such  is  fractions  of  individuals  in  a group  who  favor  a certain 
action)  there  is  quantitative  information.  It  is  likely  that  a person  will  have  a 
cognitive  representati  m of  this  information  in  the  form  of  a spatial  representa- 
tion of  continuous  quantities,  or  a set-theoretic  representation  of  discrete  units.2 
Then  procedures  that  can  use  those  representations  directly  are  more  likely  to  be 
applied  than  procedures  that  require  translation  from  those  representations  to 
numerical  representations,  such  as  Fig.  7 would. 

The  other  consideration  involves  acquisition  of  further  structures  involving 
fractional  quantities.  There  are  reasons  to  expect  that  the  spatial  representation 
involved  in  Fig.  3 may  provide  a better  basis  for  understanding  addition  and 
subtraction  of  fractions  than  the  operations  on  sets  involved  in  Fig.  5 Note  that 
if  Fig.  3,  the  two  equivalent  fractions  (such  as  f and  6/is)  are  represented  in 
the  spatial  domain  as  equal  quantities.  In  the  set-theoretic  representation  a 
change  in  the  denominator  involves  a change  in  the  number  of  elements  in  the 
set;  this  means  that  the  relationship  that  is  preserved  between  equivalent  frac- 
tions does  not  correspond  to  an  invariant  quantity. 

Now  consider  adding  two  fractions  such  as  f+ J-.  It  clearly  is  possible  to  do 
that  in  either  representation,  but  it  seems  more  natural  in  the  spatial  representa- 
tion where  changes  to  common  denominators  do  not  require  changing  the  spatial 
aze  of  the  unit.  In  the  set-theoretic  representation,  fractions  with  different 
denominators  have  different  numbers  of  elements  in  the  total  sets-for  example 
r involves  a set  with  five  members,  while  6/,s  involves  a set  with  15  members’ 
Thus,  it  might  be  expected  that  wh<  n students  have  learned  to  think  about 
fractional  quantities  as  parts  of  . region,,  they  might  more  easily  learn  addition 
and  subtraction  of  fractions  than  if  they  learned  to  think  of  fractional  quantities 
as  subsets.  This  expectation  has  some  support  in  a study  by  Coburn  (1973),  who 
compared  two  instructional  sequences  for  introducing  fractions.  In  one  condi- 
tion, fractions  were  introduced  with  diagrams  of  regions,  with  a fractional 
quantity  corresponding  to  the  part  of  the  whole  region  shaded  or  marked  in 
some  way.  The  other  condition  involved  presenting  fractions  as  ratios,  primarily 
of  numbers  of  objects  in  different  sets  (such  as  the  ratio  of  squares  to  circles, 
where  there  might  be  five  squares  and  eight  circles).  Both  sequences  included  a 
unit  on  addition  and  subtraction  of  fractions,  although  the  ratio  group  required 
some  instruction  on  part-whole  relations  prior  to  learning  about  adding  and 
subtracting  fractions  The  two  groups  did  equally  well  in  addition  and  subti  ac- 
tion of  fraction*;  oth  equal  denominators  and  on  other  general  problems 
involving  fractions.  However,  in  addition  and  subtraction  of  fractions  having 
unequal  denominators,  there  was  a substantial  advantage  favoring  the  group 


’There  is  evidence  thal  persons  represenl  qualitative  information  in  spatial  images,  even 
when  lhe  information  is  presenled  verbally;  see  DeSoto,  London,  and  Handel  1965' 
Hutlenlocher,  1968;Potts,  1972.  ’ 
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receiving  the  introductory  material  based  on  regions  rather  than  ratios  of  the 
numbers  of  members  in  sets. 

The  hypotheses  enveloped  here  about  alternative  representations  and  addition 
or  subtraction  of  fraction*  ?iso  seem  to  favor  the  spatial  model  regarding  transfer 
to  other  topics,  such  as  decimals.  Representation  of  fractions  as  subsets  implies 
that  equivalent  fractions  are  equivalent  relations  between  quantities  that  are  not 
equivalent,  and  when  applications  involve  equivalent  quantities  there  may  be  a 
conceptual  difficulty  produced  by  the  set-theoretic  representation.  On  the  other 
hand,  numerous  concepts  and  applications  involving  fractions  apparently  call  for 
understanding  of  the  kind  of  invariant  relation  involved  in  the  set-theoretic 
representation.  This  seems  to  be  the  case  for  multiplication  and  division  of 
fractions,  and  for  many  applications  involving  percentages  (cf.  Begle,  1967). 

It  seems  surely  to  be  the  case  that  the  desired  outcome  of  instruction  included 
both  the  models  shown  as  Figs.  3 and  5,  and  the  model  of  Fig.  7 as  well,  since 
that  provides  for  efficient  computation.  There  are  important  psychological  and 
pedagogical  questions  regarding  relations  among  different  representations  of 
quantity.  I will  not  try  to  develop  an  analysis  of  those  relationships  here.  It 
would  have  to  be  largely  speculative  at  this  stage  of  our  knowledge,  but 
considerable  attention  is  being  given  in  current  research  to  problems  that  should 
provide  substantial  clarification  of  this  issue. 

Conclusion 

The  examples  worked  out  here  for  fractions  have  the  feature  of  all  task  analyses 
in  showing  in  some  detail  what  it  is  that  students  must  do  in  order  to  perform 
successfully  on  exercises  and  tests.  The  knowledge  needed  c<  rresponds  to 
procedures  for  carrying  out  computations,  and  at  least  in  the  present  treatment, 
concepts  such  as  addition  of  fractions  and  equivalent  fractions  are  a form  of 
procedural  knowledge.  The  procedures  can  be  defined  on  different  representa- 
tions of  fractional  quantity,  and  alternative  models  of  the  concept  of  equivalent 
fractions  were  presented,  based  on  representations  involving  spatial  extent, 
numbers  of  elements  in  sets,  and  simple  numerical  representation.  Implications 
of  the  differences  among  the  models  were  suggested. 


EXAMPLE  2:  EUCLIDEAN  U OMETRY3 

As  students  progress  in  mathematics  training  they  are  expected  to  carry  out 
tasks  that  are  more  complex  and  require  greater  skill  in  solving  problems.  Plane 

M am  grateful  for  the  assistance  of  John  Greeno  and  Katie  Greeno  who  provided 
thinking-aloud  protocols  of  their  solutions  of  problems  in  geometry.  We  worked  through  the 
section  of  parallel  lines  in  a text  by  Clarke  (1971),  intended  for  preparation  of  students  for 
the  British  ordinary-level  examinations. 


136  JAMES  G.GREENO 


FIG.  8 Diagram  for  a problem  in  angles  and  parallel  lines. 


geometry,  taught  in  the  ninth  or  tenth  grade,  requires  sophistication  in  problem- 
solving procedures  that  is  qualitatively  different  from  that  represented  in  the 
instructional  goals  for  fractions. 

The  major  new  requirements  involve  mechanisms  for  creating  goals  as  part  of 
the  process  of  solving  problems.  In  most  exercises  in  elementary  arithmetic, 
some  numbers  are  presented  and  a procedure  is  specified-for  example,  “Add  j+ 
In  many  exercises  given  in  high  school  geometry,  a situation  is  presented  and 
a goal  is  specified,  <ud  the  student  is  required  to  supply  a set  of  procedures  for 
achieving  the  goal.  In  order  to  understand  what  students  must  know  in  order  to 
succeed  on  problems  of  this  kind,  we  need  to  use  concepts  taken  from  the 
theory  of  problem  solving,  where  goal-directed  search  mechanisms  have  been 
analyzed  (Newell  & Simon,  1972). 

Consider  the  diagram  shown  in  Fig.  8;  the  question  is,  “Given  that  P = 30°, 
find  Q."  This  kind  of  problem  is  given  is  a geometry  course  when  students  have 
studies  parallel  lines  with  transversals,  and  before  they  have  the  theorem  that 
opposite  angles  in  a parallelogram  are  equal.  Solution  requires  relating  angle  Q to 
some  other  angle  in  a diagram  which  in  turn  in  related  to  angle  P,  or  finding 
some  longer  chain  of  angles  related  to  each  other.  A solution  found  by  one 
subject  uses  the  angles  marked  in  Fig.  9.  Angle  A and  angle  P are  congruent, 
because  they  are  corresponding  angles,  so  A — 30°.  Angles  A and  B are  congruent 
for  the  same  reason,  so  B - 30  . Angles  B and  Q are  supplementary  because 
together  they  form  a straight  angle;  thus,  angle  Q is  shown  to  have  measure  Q = 
180° -30°  = 150°. 

Knowledge  for  Solving  Problems 

I will  now  present  hypotheses  about  the  knowledge  students  need  to  solve 
problems  in  geometry  of  angles  and  parallel  lines.  There  are  three  main  compo- 
nents of  the  theory.  First,  there  is  a representation  of  students’  ability  to 
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recognize  relations  between  angles  based  on  their  locations  relative  to  each  other 
and  to  parallel  lines.  A second  component  represents  students’  knowledge  of 
propositions  such  as,  “corresponding  angles  are  congruent.”  The  third  compo- 
nent is  a mechanism  that  sets  goals  and  selects  components  of  the  knowledge 
structure  that  are  needed  in  solving  specific  problems. 


Recognition  of  relations.  To  solve  a problem  of  the  kind  shown  in  Fig.  8 one 
requirement  is  that  students  learn  to  identify  relevant  relations  between  pairs  of 
angles.  A standard  exercise  involves  presentation  of  a diagram  like  Fig.  10  with 
instructions  to  “Find  four  pairs  of  corresponding  angles,  eight  pairs  of  alternate 
angles,  and  four  pairs  of  verticle  angles.”  The  performance  required  of  students 
is  that  they  be  able  to  identify  certain  patterns  of  relational  properties  The 
relevant  psychological  theory  is  the  theory  of  pattern  recognition. 

In  current  theories  of  pattern  recognition  it  is  assumed  that  recognition 

r'SV,  '<!entifying  a Iearned  pattern  of  features  (Feigenbaum,  1963;  Hunt, 
1962;  Selfndge,  1959).  The  knowledge  required  to  recognize  patterns  is  a 
network  of  feature  detectors,  linked  together  in  an  appropriate  way.  Figure  1 1 


Itxlt:  SUCCEED  | 


FIG.  11  EPAM  net  for  identifying  relation!  between  pain  of  angles. 
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shows  part  of  a network  for  identifying  relations  between  angles.  The  relations 
shown  are  alternate  angles,  interior  angles  (on  the  same  side  of  the  transversal), 
and  corresponding  angles.  These  are  relations  between  angles  with  different 
vertices,  having  two  sides  that  are  parallel.  The  notation  used  refers  to  sides  of 
angle  x as  sx  and  tx , and  the  vertex  of  angle  x a.'  v* , and  similarly  for  angle  y. 
Note  that  at  the  top  of  the  diagram  there  is  a test  to  determine  whether  the 
angles  being  tested  have  a single  vertex.  A positive  outcome  here  would  send  the 
system  off  to  components  not  shown  here,  where  vertical  angles  and  adjacent 
complementary  or  supplementary  angles  would  be  identified. 

The  system  shown  in  Fig.  11  has  the  form  of  an  EPAM  net  (Feigenbaum, 
1963).  To  illustrate  the  recognition  system,  follow  the  tests  that  would  occur  for 
a pair  of  corresponding  angles,  such  as  D and  //in  Fig.  10.  First,  the  angles  have 
different  vertices.  They  do  have  a pair  of  sides  that  are  parallel,  so  par(sx  fiy  ) is 
positive.  Their  remaining  sides  are  not  a single  segment,  so  ident(fx,  ty)  is 
negative.  However,  the  side  of  angle  H is  a straight-line  extension  of  the  side  of 
angle  D,  so  extd(tx>  ty)  is  positive.  The  angles  are  both  above  their  respective 
parallel  sides,  so  same(x,  y;  sx  / sy)  is  positive,  and  both  are  on  the  left  side  of 
the  transversal,  so  same  [x,  y;  concat(f*,  T>,)]  is  positive.  The  system  thus  arrives 
at  corr(x,  y ) and  exits  with  success. 

In  EPAM,  feature  tests  are  carried  out  serially,  in  a fixed  order.  I will  present 
some  considerations  shortly  that  question  this  aspect  of  the  model,  and  1 do  not 
consider  that  a critical  feature  of  the  theory.  The  important  psychological  idea  is 
that  a system  for  recognizing  patterns  is  a network  of  feature  tests,  and  students 
must  acquire  such  a network  as  part  of  their  knowledge  of  geometry. 

Network  of  propositions.  Students  also  learn  numerous  propositions  involv- 
ing relations  among  angles.  For  example,  “corresponding  angles  are  congruent,” 
and  “adjacent  angles  that  form  a straight  angle  are  supplementary.”  A set  of 
propositions  in  memory  is  commonly  represented  as  a network  in  which  nodes 
represent  concepts  and  links  represent  relations  among  the  concepts  (Anderson 
& Bower,  1973;  Kintsch,  1974;  Norman  etal,  1975).  Several  propositions  from 
geometry  are  shown  in  Fig.  1 2. 

First,  consider  connections  in  the  network  where  nodes  are  linked  by  dashed 
lines.  An  example  is  (VERT  X Y)  -*•  (CONG  X '/),  vertical  angles  are  congruent. 
Sometimes  there  are  three  properties  involved,  as  vhere  (RT  X)  and  (RT  T)  and 
(CONG  X Y)  are  all  joined;  the  proposition  is  that  if  X and  Y are  both  right 
angles,  then  X and  Y are  congruent. 

The  dashed  arrows  in  Fig.  12  correspond  to  inferences  that  can  be  made  or 
conclusions  that  can  be  taken.  For  example,  the  student  can  conclude  that  X 
and  Y are  congruent  if  it  is  known  that  X and  Y are  vertical  angles.  The 
propositions  shown  in  Fig.  12  thus  correspond  to  productions  in  the  sense  of 
Newell  and  Simon  (1972),  for  each  proposition  has  a condition  and  an  action 
component.  The  condition  is  given  at  the  tail  (or  tails)  of  an  arrow,  and  the 
action  is  given  at  the  head . 


A 
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FIG  12  Network  of  propositions  corresponding  to  productions  used  in  solving  problems 

anOUt  9n0m<  anH  nertllal  ‘ 


The  representation  in  Fig.  12  simplifies  the  situation  in  an  important  way  The 
nodes  there  represent  states  of  affairs  that  correspond  to  propositions.  It  is 
useful  in  considering  problem  solving  to  unpack  those  propositions  and  represent 
them  as  relational  structures.  This  is  done  in  Fig.  13.  There,  dashed  lines  still 
represent  inferences  that  can  be  made,  but  they  are  inferences  that  derive  a 
proposition  from  other  propositions.  Each  proposition  consists  of  one  or  more 
lements  (frequently  angles)  and  a property  or  a relation  involving  the  ele- 
ment(s).  Figure  13  also  represents  several  propositions  about  the  measures  of 
angles.  For  example,  in  upper  right  comer  there  is  the  proposition,  if  A"  and  Y 
are  complementary  angles,  then  the  measure  of  X plus  the  measure  of  Y is  90°. 
In  the  upper  left  comer  are  represented  some  complicated  but  important 
propositions  about  the  measures  of  concatenated  angles;  for  example,  if  angles 
A,  B, ....  K are  concatenated  to  form  angle  X,  then  the  measure  of  Af  is  equal 

to  the  sum  of  the  measures  of  A,  B and  K.  Note  that  there  is  no  specific 

notation  to  distinguish  between  the  angle  X and  the  measure  of  angle  X.  When 

“ el,ernent  throu«h  311  arithmetic  relation  (=,  +,  or  -)  it  is  understood  that 
the  element  is  the  measure  of  an  angle. 

Now  return  to  Fig.  9,  and  consider  the  network  that  represents  the  relations 
among  angles  in  the  diagram  used  in  obtaining  the  solution.  Initially  the 
information  is  given  that  P = 30°,  and  the  problem  is  to  find  the  measure  of  Q. 
The  solution  is  obtained  as  follows:  First,  angle  A is  noticed,  and  since  A and  P 
are  corresponding  angles,  the  measure  30°  is  assigned  to  A Next,  angle  B is 


4 
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noticed,  and  since  A and  B are  corresponding  angles,  the  measure  30°  is  assigned 
to  angle  B.  Finally,  since  B and  Q form  a straight  angle,  the  measure  of  Q is  180° 
- 30  =150.  Note  that  each  of  these  inferences  is  represented  in  the  diagram  by 
a dashed  line.  The  solution  of  the  problem  is.  shown  as  a connected  relational 
network  that  satisfies  the  requirement  of  connecting  Q with  a quantity-that  is 
assigning  a measure  to  Q.  Each  step  in  solving  the  problem  corresponds  to  a 
proposition  found  in  the  network  of  propositions  shown  in  Fig.  1 3. 

Problem-solving  procedures.  Knowledge  structures  like  those  represented  in 
Fig.  1 1 and  Fig.  13  are  necessary  for  a student  to  solve  geometry,  but  they  are 
not  sufficient  An  additional  requirement  is  a system  for  interpreting  a problem, 
setting  goals,  and  selecting  productions  from  the  knowledge  base  for  use  in 
generating  the  relations  needed  for  solution  of  the  problem.  The  ideas  to  be 
presented  here  are  an  attempt  to  use  intuitions  about  problem  solving  that  have 
been  recognized  for  many  yean,  especially  by  Duncker  (1945),  Selz  (1913),  and 
Wertheimer  (1959).  When  a problem  is  undentood,  the  person  perceives  certain 
structural  relations  among  components  of  the  problem.  However,  the  structural 
pattern  is  not  complete,  and  that  is  why  there  is  a problem.  Thus,  problem 
solving  can  be  seen  as  a process  of  modifying  a structure  in  order  to  complete  a 
pattern.  Recent  contributions  to  the  theory  of  problem  solving  have  developed 
formal  representations  of  goal-directed  pattern  matching  (Hewitt,  1969;  Wino- 
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grad,  1972)  and  aearch  for  operators  or  productions  that  achieve  progress  toward 
soluton  of  a problem  (Ernst  3l  Newell,  1969;  Newell,  1972b;  Newell  & Simon, 
1972). 

Problems  in  the  geometry  of  parallel  lines  and  angles  can  be  solved  by  a system 
that  can  satisfy  goals  consisting  of  patterns  that  may  be  matched  in  the  problem 
situation.  The  system  keeps  a list  of  angles  whose  measures  are  known,  and  a list 
of  relations  between  pairs  of  angles  that  have  been  found  during  the  process  of 
problem  solving.  The  system  knows  about  quantitative  relations  such  as  (CONG 
X Y),  X and  Y are  congruent,  and  (SUPP  X Y),  X and  T are  supplementary.  It 
also  knows  about  geometric  relations  such  as  (CORR  X Y)  and  (VERT  X Y), 
which  it  can  evaluate  using  feature  tests  such  as  those  in  Fig.  11,  and  it  knows 
which  quantitative  relation  can  be  inferred  from  each  geometric  relation,  as 
shown  in  Fig.  12. 

A problem  is  presented  in  '.he  form  of  a diagram,  a goal,  and  some  given 
information.  The  system  assimilates  the  given  information  and  sets  the  goal  as 
presented.  For  the  problem  shown  in  Fig.  8,  the  structures  arc 

(MEAS  P 30)  (GOAL  * (ME AS  Q ?NUM)) 

(My  notation  here  is  a mixture  of  Newell’s  (1972b)  notation  for  a production 
system  oriented  toward  problem  solving,  and  the  simplified  PLANNER  syntax 
used  by  Winograd  (1973).  The  asterisk  marks  the  current  goal  of  the  system,  and 
is  replaced  by  % if  the  goal  is  set  aside  temporarily  while  another  goal  is 
attempted.  A pattern  such  as  (MEAS  Q ?NUM)  specifics  a property  or  relation 
first,  then  the  objects  that  have  the  property  or  relation.  A question  mark 
indicates  a gap  in  the  pattern,  and  the  goal  is  to  find  some  object  that  satisfies 
the  gap.  For  example,  ?NUM  indicates  that  the  gap  is  to  be  tilled  by  a number.) 

The  system  works  by  evaluating  its  current  goal.  There  are  several  kinds  of 
goal,  each  corresponding  to  a procedure.  The  procedure  succeeds  if  certain 
specified  conditions  are  found  in  the  data  structures  containing  list  of  known 
measures  and  relations  on  angles.  If  a goal  succeeds,  the  system  carries  out  an 
action  called  ASSIGN  which  adds  an  appropriate  entry  to  the  data  structure  and 
deletes  the  accomplished  goal  from  the  goal  structure.  If  the  goal  fails,  a new  goal 
is  created  and  the  old  goal  is  saved.  The  system  tries  to  accomplish  specific  goals 
first,  then  retreats  to  weaker  goals  that  can  produce  results  of  possible  use  to  the 
stronger  goals  that  have  failed  earlier. 

As  an  example,  when  (GOAL  * (MEAS  Q ?NUM))  is  evaluated,  each  angle 
with  known  measure  is  examined  to  see  whether  it  is  part  of  a structure  in  which 
a quantitative  relation  links  the  known  angle  with  angle  Q.  The  procedure  used  i< 
a variant  of  the  MATCH  process  used  in  HAM  (Anderson  & Bower,  1973).  If 
such  a structure  were  found,  the  procedure  would  return  a structure  of  the  form 
(QR  Q NUM)  Where  QR  is  the  quantitative  relation  found  (CONG,  COMP, 
SUPP,  or  CIR),  and  NUM  is  the  numerical  value  of  the  known  angle  found  in  the 
structure.  If  ute  goal  succeeded,  the  system  would  <urry  out  the  action  ASSIGN- 
MEAS,  creating  a data  structure  in  which  Q would  be  assigned  a measure  equal 
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to  NUM  if  the  relation  found  were  CONG,  or  90  - NUM  if  QR  were  COMP  or 
180  - NUM  if  QR  were  SUPP,  or  360  - NUM  if  QR  were  CIR.  In  solving  the 
problem  of  Fig.  8,  the  goal  of  assigning  measure  fails  initially,  so  the  system 
takes  the  action  (SETGOAL  (?QREL  Q ?ANGM). 

Now  the  goal  structure  is  the  following  list: 

((GOAL  * (?QREL  Q ?ANGM)XGOAL  % (MEAS  Q ?NUM))> 

Again,  the  system  examines  the  angles  with  known  measure,  hoping  to  create  a 
structure  in  which  a quantitative  relation  links  angle  Q with  a known  angle.  This 
goal  succeeds  if  there  is  a data  structure  in  which  Q is  linked  to  a known  angle 
through  one  of  the  geometric  relations  (CORR,  VERT,  ST, ... ) from  which  a 
quantitative  relation  can  be  inferred.  Again,  no  such  relation  is  found,  so  the 
system  tries  to  create  one,  setting  up 

(GOAL  * (?REL  Q ?ANGM». 

This  goal  examines  the  angles  with  known  measure,  testing  the  feature  < cf  each 
one  in  relation  to  Q using  the  recognition  network  shown  in  Fig.  1 ! . Uus  fails, 
since  P,  the  only  known  angle,  has  no  side  that  is  either  identical  to  a side  of  O 
or  a straight-line  extension  of  a side  of  Q. 

The  system  has  failed  in  all  its  attempts  to  directly  link  Q with  an  angle  that 
has  known  measure.  It  then  retreats  to  the  following  goal: 

(GOAL  • ((MEAS  ?ANG  ?NUM)(NEARER  Q 9ANG  YANGM))). 

This  goal  tries  to  assign  measures  to  some  angle  that  can  be  *V  r 1 in  the  dia- 
gram that  is  nearer  angle  Q than  some  angle  whose  measure  is  a!  xnown.  The 
property  NEARER  is  defined  on  a path  consisting  of  tram  sals  between 
parallel  lines.  The  procedure  takes  P (still  the  only  known  angle)  and  works 
through  its  list  of  geometric  relations  until  it  finds  one  that  it  can  pair  with  P 
and  match  the  features  of  the  pair  with  the  pattern  needed  to  identify  the 
relation.  In  the  diagram  of  Fig.  8,  all  the  geometric  relations  can  be  found  for 
angles  paired  with  P;  presumably  the  one  that  is  found  is  the  one  at  the  top  of 
the  system’s  list  of  relations.  (Note  that  trying  to  find  an  angle  fitting  a specific 
relation  involves  activating  a terminal  node  in  the  recognition  network  and 
testing  the  pattern  of  features  connected  to  that  node,  rather  than  working 
own  the  network  from  the  top.  The  process  is  again  analogous  to  the  MATCH 
Pro““in  HAM  (Anderson  & Bower,  1973).  in  this  case  the  probe  received  by 
MATCH  is  the  angle  P and  a terminal  node  that  names  a relation.) 

The  goal  finds  angle  A and  identifies  the  relation  (CORR  PA),  then  returns 
e structure  (CONG  A 30).  This  leads  to  assigning  measure  30  to  angle  A in  the 
list  of  known  angles.  Now  the  system  returns  to  its  previous  goal  (?REL 
’ANGM).  With  a new  known  angle  is  the  list,  a geometric  relation  with  Q might 
be  found,  but  it  is  not.  (Note  that  all  tne  features  of  corresponding  angles  arc 
present  except  the  last  one  requiring  that  the  angles  be  on  the  same  side  of  the 
transversal.)  This  causes  the  system  to  recreate  the  goal  ((MEAS  ?ANG  ?NU- 
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“Kf fcoJr  WhiCh  “C“'<h  ta)"*h  (CORRA  B) 

d returns  (CONG  B 30);  then  the  system  assigns  measure  30  to  angle  B 

Now  the  system  returns  to  its  goal  (?REL  Q ?ANGM)  again,  and  this  time  it 

in  s a own  angle  related  to  Q;  the  goal  now  returns  (ST  Q B)  and  this 

£2?  “ the  Ust  of  kBOWn 

it  fu  ,ANGM  1S  reactlvated  and  it  succeeds,  returning  (SUPP  Q B) 

succeedVtheS^isT™  t0  **  8°al  (MEAS  Q ?NUM>  and  this 

succeeds,  the  last  act  is  to  assign  measure  150  to  angle  Q. 

The  preceding  illustrates  the  process  of  problem  solving.  1 will  now  present  a 

nighest  level  the  procedure  is  a production  system  that  takes  a goal  as  a 
conchtion  and  evaluates  the  goal  as  an  action.  Then  the  outcome  of  evaluating 
goal  becomes  part  of  the  condition,  and  an  action  of  creating  a datf 

“l,'Sr  7! if  ""  ",CC"d'd;  * ”•  » created!  At 

thJ  level,  the  model  represents  general  skills  involved  in  problem  solving 
including  such  strategies  as  trying  to  find  a direct  link  between  data  Td 

UnTheWorocT  f ^ W°rking  °n  somethin8  more  complicated. 

procedures  for  evaluatmg  goals  incorporate  knowledge  about  the  rela 
tional  properties  and  propositions  involved  in  the  task  domain  Each  evaluation 
procedure  looks  at  data  structures  that  represent  relations  among  components  of 
die  problem  and  determines  whether  other  needed  relations  can  be  in  fe  reed 

iUir?  h h 3 pr°f Cti0n  WJtem  functionin8  at  the  level  of  specific 
inferences  made  during  the  process  of  solving  a problem.  As  an  example  the 

(SgNOREU^8  ^ tg°d,(?QREL  X ? ANGM>  »d  the  procedure 
(ASMGNQREL)  is  equivalent  to  the  following  set  of  productions: 

<G0AL  'KS  n ' (OOAI-*(MEAS,'?NUM» 

(ST  X Y)  -*•  (SUPP  x Y). 

(CIRCLE  X Y)  -*•  (CIR  X Y) 

(VERT  X T)  (CONG  X Y) 

(CORR  X Y)  -*  (CONG  * Y) 

(ALT If)  •+  (CONG X Y) 

(INTSAM  X Y)  -►(SUPP  X Y). 

That  is,  when  the  goa!  structure  is  as  shown  in  the  first  line,  and  tiie  data 
s ructure  contains  the  element  on  the  left  of  one  of  the  other  lines,  then  the 
element  on  the  nght  of  that  line  can  be  created. 


it 


7 ' 


Pattern  Recognition  and  Constructions 

In  many  geornetry  problems,  the  material  presented  does  not  permit  a solution- 
the  problem  solver  must  supply  additional  lines.  An  example  of  such  problems 
in  Fig.  14,  given  that  AB  is  parallel  to  CD,  find  an  equation  connecting*,  Y, 
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FIG.  14  Diagram  for  a problem 
requiring  a construction. 


and  Z.  One  solution  is  obtained  by  constructing  a parallel  to  AB  through  the 
vertex  of  Y,  creating  angles  Yx  and  Yr . Then  (ALT  X Y , ) -*■  X = Y , and  (ALT 
Z Y2)-*  Z=  Tj , giving  Y i + Y2  = X + Z,  and  then  (CONCAT  (K i Yt)Y)-*Y- 
Yi  + Y^soY^X  + Z. 

The  interesting  psychological  question  is  how  the  problem  solver  thinks  of 
making  the  construction.  One  way  for  this  to  happen  would  be  to  arrive  at  the 
goal  of  finding  a relation  between  X and  Y,  test  the  features  of  some  relation, 
and  find  a partial  match.  In  fact,  one  subject  solving  this  problem  said,  “If  there 
were  a parallel  line  here . then  X and  Y would  be  equal." 

In  order  to  solve  problems  requiring  constructions,  the  problem-solving  system 
should  be  able  to  detect  partial  patterns,  and  should  have  productions  for 
completing  patterns  by  adding  new  points  and  lines  in  the  problem.  Recognition 
of  the  need  for  a construction  is  similar  to  the  understanding  of  a problem-it 
involves  matching  components  in  the  problem  with  a stored  problem  and  finding 
a gap  or  a partial  mismatch.  The  idea  sketched  here  of  constructions  related  to 
subgoals  and  pattern  recognition  is  quite  similar  to  Gelemter’s  (1963)  treatment 
of  the  problem,  where  constructions  are  developed  when  subgoals  have  failed, 
and  a frequent  cause  of  failure  har  been  the  absence  of  a feature  that  can  be  put 
into  the  problem  with  an  available  construction  theorem.  The  present  discussion 
has  considered  only  the  process  of  recognizing  that  a construction  would  be 
useful;  actual  mechanisms  for  making  constructions  have  been  described  by 
Scandura,  Durnin,  and  Wulfeck  (1974). 

Meaningful  Solution  Structures 

Gestalt  psychologists  such  as  Duncker  (1945)  and  Wertheimer  (1959)  empha- 
sized the  desirability  of  teaching  students  meaningful  solutions  of  problems, 
rather  than  rote,  mechanical  forms  of  solution.  The  concept  of  meaningfulness 
in  problem  solving  has  generally  depended  on  the  intuitions  of  authors  and 
readers.  Perhaps  some  progress  can  be  made  toward  pinning  the  concept  down 
by  examining  the  relational  networks  that  represent  alternative  solutions  to 

problems. 
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FIG.  15  Diagram  for  the  pro- 
blem of  vertical  angles. 


i WUJ  discuss  one  of  the  problems  that  Wertheimer  (1959)  considered  the 
problem  of  vertical  angles.  1 will  present  analyses  of  two  solu  Ztat  We r 
thetmer  presented  to  illustrate  his  distinction  between  mearSnZ  Z Ze 

compared>  5nd  two  ***  ^ 

Cflr!,I5rrn0,!,i0n  f°r  ^ Probte^  of  vertical  angles.  Given  that  AB  and 
CD  are  straight  lines  that  intersect  at  0,  prove  that  * and  Y are  congruent 

A typical  statement  of  the  proof  goes  as  follows: 

1.  X + £A0C=  180°,  since  they  form  a straight  angle. 

2.  K + L40C=  J 80  , for  the  same  reason  as  (1). 

3.  X + LA0C=Y+LA0C,  since  they  both  equal  180°. 

J.  X * Y,  since  UOC  can  be  subtracted  from  both  sides  of  (3). 

A graph  showing  the  relations  in  this  proof  is  shown  in  Fig  16 

M 1“d  ,hiS  P'°°f  " b'inS  His  eeitknc. 

whin  lsS  o ,TP  T"  ,cMons  ““d'd  the  observation  that 
iiOD-ajld  then  become  pSed  ’ °f'“  * * UK  * ,80”'  r + 

”ms  ,o  r“  wer,heira",!  f— **« 

5 a »e  both  straight  angles. 

2.  X and  Z^IOC  form  LDOC by  concatenation. 


FIG.  16  Diagram  of  a solution 
of  the  problem  of  vertical  angles 
primarily  using  algebraic  relations. 
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3.  Y and  LAOC  form  LAOB  by  concatenation. 

4.  X and  Y are  congruent,  because  they  form  congruent  angles  when  they  are 
concatenated  with  the  same  third  angle. 

A diagram  showing  this  proof  is  shown  in  Fig.  17. 

There  are  two  apparent  differences  between  Fig.  16  and  Fig.  17.  First,  Fig  17 
is  slightly  simpler  than  Fig.  16.  Second,  Fig.  17  uses  only  geometric  relations  and 
properties,  while  most  of  the  relations  in  Fig.  16  are  algebraic.  It  seems 
reasonable  to  suppose  that  both  of  these  properties  relate  to  meaningfulness  of  a 
solution. 

One  sense  of  the  concept  of  meaningfulness  involves  coherent  structure.  We 
would  say  that  a student  has  better  understanding  if  all  components  of  a 
problem  are  linked  closely  with  many  other  components,  rather  than  each 
element  being  connected  with  only  one  or  two  other  components.  In  general, 
closer  linking  will  correspond  to  simpler  structure.  In  Fig.  17,  the  congruence  of 
X and  Y is  derived  in  one  step  from  the  congruence  of  LDOC  and  LAOB, 
combined  with  the  concatenations  involving  X and  Y with  LAOC.  In  Fig.  16,  the 
route  to  X = Y is  slightly  more  circuitous,  involving  equality  of  two  quantities 
because  they  both  equal  180°,  and  an  algebraic  operation  on  the  expressions  A' + 
LAOC  and  Y + LAOC. 

It  should  be  noted  that  the  simplicity  of  a certain  kind  of  solution  is  well 
defined  only  with  respect  to  a fixed  set  of  productions.  A simpler  structure  than 
Fig.  16  would  apply  to  a subject  who  had  a production 

(X  + A =M)  and  (Y+A=M)-+  (X  = Y). 

It  seems  to  agree  with  intuition  that  a student  with  a richer  set  of  complex 
productions  would  have  a better  understanding  of  problems  than  a student  who 
had  to  work  out  many  sequences  of  small  steps.  On  the  other  hand,  this  shows 
that  the  question  of  meaningfulness  cannot  be  decided  on  grounds  of  simplicity 


PIG.  17  Diagram  of  a solution  of 
the  problem  of  vertical  angles 
using  geometric  relations. 
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in  favor  of  one  kind  of  solution  rather  than  another,  since  either  can  probably 
haw  a complex  or  a simple  version,  depending  on  the  complexity  of  a person’s 
knowledge. 

The  second  feature  distinguishing  Fifi.  16  and  17  is  the  extent  to  which  they 
use  geometric  relations,  rather  than  a *ebraic  operations.  I think  that  this  may 
have  been  what  Wertheimer  had  in  mi  d in  referring  io  understanding  structural 
relations  in  this  problem,  rather  than  applying  an  algorithm  in  a way  that  often 
might  seem  arbitrary  in  the  sense  of  lacking  motivation  in  the  domain  of  the 
problem. 

The  distinction  car.  be  made  rigorous  if  we  define  two  problem  spaces,  one 
having  productions  that  we  call  geometric,  the  other  having  productions  that  we 
call  algebraic.  A problem  in  geometry  can  be  solved  entirely  in  the  problem 
space  of  geometry,  if  sn  appropriate  set  of  productions  exi.,  s there  and  they  are 
found  and  applied.  Alternatively,  there  may  be  a mapping  of  some  geometric 
properties  and  relations  into  the  uomain  of  algebra.  These  could  be  translations 
of  properties,  or  they  could  involve  productions  that  take  geometric  properties 
as  conditions  and  create  algebraic  objects  as  actions.  When  objects  are  created 
that  can  satisfy  the  conditions  of  algebraic  productions,  then  problem  solving 
can  go  on  in  the  problem  space  of  algebra.  After  an  appropriate  set  of  produc- 
tions has  been  applied,  a translation  back  to  geometric  objects  can  be  carried 
out,  if  it  is  needed.  (Strictly  speaking,  the  solution  in  Fig.  16  is  incomplete.  A 
final  step  using  the  proposition,  “if  two  angles  have  equal  measure,  they  are 
congruent,”  would  finish  the  job.) 

The  distinction  between  solving  a problem  in  its  own  problem  space  and 
translating  into  another  for  purposes  of  computation  probably  is  subject  to 
considerable  blurring,  especially  if  we  consider  the  result  of  experience  in 
applying  productions  from  one  domain  to  solve  problems  in  another.  It  seems 
likely  that  any  pair  of  productions  of  the  form  A -*•  B.  B -*•  C,  if  used  often 
enough,  would  soon  lead  to  the  existence  of  a production  A -*•  C.  By  a similar 
process  of  fusion,  it  seems  likely  that  a student  who  has  applied  algeb.aic 
operations  many  times  to  geometric  quantities  (spatial  representations  of  angles, 
areas,  and  so  on)  would  probably  have  what  amounted  to  a set  of  productions 
for  manipulating  geometric  quantities,  without  explicit  translation  into  algebraic 
operations.  Clearly,  the  question  of  meaningfulness  of  a problem  solution  is 
relative  to  the  specific  set  of  productions  that  a problem  solver  has  available, 
whether  we  consider  meaningfulness  as  solving  in  the  problem  domain  or  as 
producing  as  a solution  a well-integrated  relational  structure. 

I have  some  anecdotal  evidence  that  achievement  of  the  apparently  more 
meaningful  solution  in  fact  depends  on  the  student’s  having  a general  production 
for  manipulating  quantities  of  the  form 

(CONCAT  (A  B)X)  and  (CONCAT  (A  C)  Y) 
and  (CONG  XY)^  (CONG  B Q. 
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One  subject  with  whom  1 have  worked  on  this  problem  did  not  know  how  to 
prove  that  vertical  angles  are  congruent  when  we  began.  I gave  the  steps  of  a 
proof  stating  the  two  sums  that  equal  180°,  making  a single  equation,  and 
obtaining  the  equality  by  subtracting  the  same  quantity  frc.m  both  sides  of  the 
equation.  About  two  weeks  later,  I asked  the  subject  whether  she  remembered 
the  proof;  she  did  not.  Then  I gave  some  different  examples  involving  concatena- 
tion of  quantities.  One  example  involved  weighing  suitcases  by  holding  them  and 
standing  on  a scale.  If  two  suitcases  pioduce  the  same  weight  when  they  are 
combined  with  a person,  the  suitcases  must  be  of  equal  weight.  The  other 
example  involved  distances  from  dty  to  city  given  on  a map.  If  the  distance 
from  Liverpool  to  London  via  Birmingham  equals  the  distance  from  Birmingham 
to  Dover  via  London,  then  the  distance  from  Liverpool  to  Birmingham  must 
equal  the  distance  rom  London  to  Dover  With  these  items  of  background,  the 
subject  generated  the  proof  of  equal  vertical  angles.  Then  she  solved  Wer- 
theimer’s transfer  problem  where  the  angles  shown  are  overlapping  right  angles, 
and  she  remembered  the  proof  about  vertical  angles  on  two  later  occasions— one 
two  days  later  and  tlie  othei  seven  months  later. 

This  anecdote  does  not  provide  sufficient  evidence  for  any  definite  conclusion 
about  exact  structural  relations  in  the  problem  of  vertical  angles.  It  docs 
illustrate  a use  of  the  theoretical  analysis  in  identifying  the  cognitive  component 
needed  to  solve  special  problems.  In  this  case,  if  my  analysis  is  correct,  the 
needed  component  is  a production  dealing  with  combinations  of  quantity  in  a 
general  way,  rather  than  with  specific  geometric  concepts. 


Conclusion 

Geometry  represents  at  least  two  levels  of  knowledge  that  arc  more  complex 
than  ’■re  involved  in  the  simple  kinds  of  computation  involved  in  elementary 
fractions.  The  recognition  network  needed  to  identity  relations  between  angles 
involves  a concept  in  the  form  of  a procedure  for  processing  stimulus  features. 
That  seems  no  more  complex  than  the  procedures  for  finding  equivalent  frac- 
tions and  other  similar  operations  in  elementary  arithmetic.  However,  the 
inferences  needed  to  solve  problems  require  a network  of  propositional  knowl- 
edge corresponding  to  productions  that  take  properties  and  relations  as  condi- 
tions and  generate  new  relations  as  actions.  And  the  system  requires  general 
knowledge  of  relations  between  goals,  to  select  propositions  in  a way  that  will 
lead  to  solutions  to  problems  that  are  presented. 

The  general  analysis  of  problem  solving  as  recognition  of  partial  patterns 
provides  a framework  for  analyzing  the  process  of  recognizing  the  need  for  a 
construction  in  a geometric  problem.  The  framework  also  provides  a way  of 
comparing  solutions  of  problems  that  partially  clarifies  the  troublesome  concept 
of  meaningfulness  in  problem  solving. 
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EXAMPLE  3:  AUDITORY  PSYCHOPHYSICS 

The  tasks  to  be  considered  in  this  third  section  require  yet  another  increase  in 
the  complexity  of  performance  by  a student.  Nearly  all  problems  given  in  high 
school  mathematics  are  well-defined  problems  in  the  sense  that  they  present  a 
specific  goal  and  a specific  set  of  premises  or  data  to  work  from.  In  many 
situations  students  are  asked  to  produce  paragraphs  or  brief  essays  as  answers  to 
questions.  While  many  questions  require  only  simple  retrieval  of  factual  informa- 
tion from  memory,  others  present  ill-defined  problems  (Reitman,  1965)  in 
which  the  student  must  generate  more  than  a path  of  operations  leading  to  a 
goal. 

The  material  that  I will  consider  in  this  last  section  is  part  of  the  content  of 
introductory  college  psychology,  and  the  selection  of  content  for  this  discussion 
is  taken  from  the  introductory  text  by  Lindsay  and  Norman  (1972).  I do  not 
intend  to  suggest  that  the  kinds  of  complex  semantic  processing  that  I will 
discuss  here  are  confined  to  college-age  adults.  The  processes  are  required 
whenever  a person  generates  complex  substantive  material,  as  in  the  telling  of 
stories,  and  much  of  that  is  done  by  quite  young  children.  Certainly,  complex 
question  answering  is  expected  of  students  in  junior  high  school  and  high  school 
in  many  of  their  courses. 

Semantic  Networks 

A structure  of  concepts  and  relations  can  be  represented  conveniently  as  a 
network,  and  a majority  of  investigators  use  such  a representatior  to  charac- 
terize the  knowledge  required  for  answering  questions  (see  especially  Anderson 
& Bower,  1973;  Kintsch,  1974;  Norman  et  al.,  1975).  The  notation  that  I will 
use  here  is  similar  to  that  of  Norman  et  al.  (1975),  which  also  is  included  in 
Li:  dsay  and  Norman’s  (1972)  text  and  is  used  in  Norman’s  chapter  in  this 
volume.  I have,  however,  reversed  the  roles  of  elements  and  relations,  partly  to 
be  consistent  with  my  earlier  discussion  and  partly  because  some  of  the  discus- 
sion of  this  material  is  helped  by  having  attention  focussed  on  relations  rather 
than  on  concepts  as  the  main  components  of  the  structure. 

The  content  of  psychophysics  includes  concepts  fron  physics,  biology,  and 
psychology.  Most  of  the  information  given  by  Lindsay  tnd  Norman  about  the 
physics  of  sound  is  shown  in  Fig.  18.  Most  of  the  relations  shown  such  as  ISA, 
HAS,  and  CAUSE  have  been  used  frequently  in  many  discussions  of  semantic 
memory.  ISA  denotes  category  membership.  HAS  denotes  a relation  of  property 
attribution,  which  takes  several  foims  not  distinguished  here  such  as  having  parts 
(components  of  a complex  wave)  and  having  uni*s  (such  as  Hertz  for  frequency). 

The  representation  of  Fig.  18  is,  of  course,  highh  schematic.  A complete 
description  would  include  many  distinctions  not  made  in  the  diagram,  and 
would  require  elucidation  of  several  components.  Tn  general,  the  concepts  and 
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relations  correspond  to  schemata  that  can  be  unpacked  if  necessary  either  by  a 
theorist  for  more  detailed  analysis  or  (in  a different  sense)  by  the  subject  when 
necessary  in  considering  specific  aspects  of  a topic  or  question.  An  example  is 
the  concept  of  a function,  which  is  a general  schema  involving  a relation  of 
corrjttpondenM  between  members  of  two  or  more  sets.  Thus  the  node  RE- 
LATES and  the  presence  of  two  variables  whose  values  are  connected  by  the 

3Sf  533115  °f  the  schema  “Seated  here  by  the  single  term 
function.  This  kind  of  conceptual  embedding  is  discussed  in  more  detail  by 
Norman , Centner,  and  Stevens  in  Chapter  9 of  this  volume . 

The  dashed  lines  and  diagrams  in  Fig.  18  involve  concepts  whose  understand- 
ing apparently  includes  production  of  a diagram  or  image.  The  hypothesis 
involved  here  is  similar  to  that  used  in  the  section  on  fractions,  where  I 


FIG.  19  Network  of  propositions  for  anatomy  of  hearing. 
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presented  the  idea  thit  some  processing  can  be  mediated  by  use  of  a representa- 
tion of  quantity  either  in  spatial  or  set-theoretic  te  ns.  The  use  of  generative 
processes  involving  images  for  question  tnswerir  i has  been  noted  before, 
notably  by  Jorgenson  and  Kintsch  (1973)  by  Norman  (1973).  Norman’s 
discussion  is  especially  pertinent.  His  example  involved  generating  a floor  plan, 
and  subjects  used  rules  based  on  general  properties  of  rooms  and  buildings,  even 
when  this  led  to  mistakes  in  the  specific  task  they  were  engaged  in. 

Just  as  many  of  the  nodes  in  Fig.  18  represent  concepts  that  are  not  spelled 
out  in  detail,  the  diagrams  presented  there  only  sketch  the  representation  that 
we  would  hope  students  acquire  regarding  sound  waves.  The  procedure  for 
generating  the  image  of  a sine  wave  should  be  connected  to  the  concepts  of 
frequency  and  amplitude  in  ways  that  I have  not  worked  out  in  detail,  and  the 
properties  of  the  various  diagrams  involving  concepts  of  vibrations,  a pressure 
wave,  and  a sinusoidal  function  all  should  be  related  to  each  other  in  definite 
ways  not  specified  here.  Analysis  of  these  cognitive  structures  would  be  a task 
well  within  the  technical  capabilities  available  at  present,  although  some  new 
understanding  would  probably  be  achieved  by  developing  detailed  models  here, 
as  with  other  domains. 

Figure  19  shows  a semantic  network  containing  the  main  concepts  of  anatomy 
connected  with  hearing.  This  figure  seems  entirely  straightforward;  however, 
note  that  HAS  has  yet  another  meaning  here  (the  cochlea  is  characterized  by 
some  of  its  properties,  and  by  some  of  its  parts),  and  the  relation  CONNECT  TO 
refers  to  rather  different  kinds  of  anatomical  relations  (the  way  in  which  the 
malleus  is  connected  to  the  incus  is  quite  different  from  the  way  hair  cells  are 
connected  to  neurons). 

Figure  20  shows  a network  of  concepts  that  refer  to  events  that  occur  when  a 
sound  wave  produces  a neural  reaction  in  the  brain.  The  descriptions  of  proper- 
ties of  neural  responses  of  different  kinds  are  severely  abbreviated  here,  and 
good  knowledge  of  these  would  involve  quite  an  intricate  structure  of  interrela- 
tionships. 

An  important  feature  of  Fig.  20  is  its  inclusion  of  concepts  from  both  Fig.  1 8 
and  Fig.  19.  The  knowledge  structure  that  we  expect  students  to  acquire  is  a 
synthesis  of  all  thr.e  of  these  networks.  It  should  be  noted  also  that  the 
knowledge  shown  in  Fig.  20  should  relate  strongly  to  the  student’s  general 
knowledge  .bout  neural  processes,  as  well  as  general  concepts  of  anatomy  and 
physics. 

Process  of  Answering  Questions 

Many  questions  can  be  answered  by  a relatively  simple  process  of  retrieving 
factual  information  that  is  stored  in  memory.  For  example.  Fig.  19  contains  the 
answer  to  the  question,  “which  bone  of  the  middle  ear  is  connected  to  the  ear 
drun?”  A process  for  retrieving  facts  from  memory  has  been  developed  in  detail 


154  JAMES  G.  GREENO 


ext'tatory 


inhibitor 


increase 


[cry 


kulft  ^ 


changing  frvquanc. 


[o: 


frrqutnc , 
(of  iOU^I 


Mount  of 

(local  activity 


«i0  bonding 


, _ (jl^ChnoflrJ 


Wf\  (of  sound  mvt 


Wwrom  intensity 

' yof  sound  wave) 

FIG.  20  Network  of  propositions  .boul  evenu  that  occur  when  a sound  is  heard. 

by  Anderson  and  Bower  (1973);  it  involves  entering  the  memory  structure  at 
opponents  mentioned  in  the  question  (in  this  case,  middle  ear,  bone,  and  ear 
drum)  and  searching  for  a match  to  a specified  structure.  If  a match  is  found  the 
information  is  retrieved  in  the  form  of  a propositional  structure  containing  the 
answer  required.  In  this  example,  the  matching  structure  would  contain  the  linlcs 
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(ISA  malleus  bone)  (CONNECT-TO  malleus  cochlea),  and  the  answer  would  be 
“the  malleus.” 

The  process  of  retrieving  facts  from  memory  can  be  seen  as  a kind  of  pattern 
matching;  in  fact,  the  mechanism  proposed  in  the  preceding  section  for  pattern 
matching  in  problem  solving  was  borrowed  from  Anderson  and  Bower  s(1973) 
discussion  of  fact  retrieval.  Some  information  resides  in  memory  as  a substruc- 
ture of  a person’s  knowledge.  A question  is  asked,  and  the  question  contains 
components  that  match  components  of  the  stored  information.  The  person 
retrieves  the  pattern,  including  components  that  were  not  in  the  question,  and 
the  new  retrieved  components  constitute  an  answer  to  the  question. 

Less  specific  questions  can  be  asked,  and  their  answers  require  some  selection 
and  judgment  by  the  person  who  answers.  “What  is  the  basilar  membrane? 
could  be  answered  (from  Fig.  19),  “a  membrane  in  the  cochlea,  connected  to 
hair  cells,”  or  (from  Fig.  20),  “the  thing  that  has  a travelling  bulge  caused  by 
pressure  waves  in  the  cochlea.”  or  a number  of  other  possibilities,  including  a 
combination  of  the  two  mentioned  above.  One  thing  a student  must  do  is  decide 
how  much  information  is  needed:  “tell  me  about  sound  waves,  can  call  for  a 
brief  paragraph  or  a 30-page  article  or  a book.  Also,  the  degree  of  specificity 
required  in  an  answer  sometimes  is  uncertain.  In  some  contexts,  the  question, 
“where  is  the  basilar  membrane?”  might  be  answered  best  with  “in  the  ear,  but 
in  other  contexts,  “in  the  cochlea”  might  be  more  appropriate  (cf.  Norman, 
1973).  Clearly,  there  are  some  important  principles  of  social  psychology  oferat- 
ing  in  the  answering  of  most  questions,  in  which  the  answerer  applies  assump- 
tions about  the  knowledge  structure  of  the  asker  in  deciding  what  kind  of 
information  is  most  relevant  to  the  question.  This  is  understood  well  by  students 
who  often  spend  some  - me  during  examinations  trying  to  judge  “what  the 
instructor  wants"  as  an  ar  wer  to  a question . 

One  das  of  questions  used  frequently  in  examinations  seems  to  raise  a special 
set  of  theoretical  questions.  These  «ie  questions  requiring  explanations  of 
phenomena  or  relationships.  A relatively  simple  example  is  the  question, 
“explain  how  pressure  waves  in  the  cochlea  produce  firing  of  neurons.  To  arrive 
at  an  answer,  a student  should  generate  a sequence  of  components  each  involving 
a CAUSE  relation.  A mechanism  like  the  one  described  for  finding  the  measure 
of  an  angle  in  Fig.  8 would  be  suitable.  The  process  might  start  by  setting  a goal 
of  finding  an  $vent  that  causes  firing  of  neurons,  then  checking  whether  that 
matched  the  “pressure  waves  in  the  cochlea”  taken  from  the  question,  and  if 
not,  searching  back  further  until  a path  of  causal  link*  had  been  established.  As 
in  the  case  of  problem  solving,  successful  performance  depends  both  on  having 
appropriate  knowledge  structures  and  having  an  appropriate  strategy  for  generat- 
ing goals  that  dive  the  pattern  matching  and  search  processes  needed  to  obtain 

an  answer.  , 

More  complicated  processes  are  needed  for  a question  such  as  the  following 
“A  recording  is  mai;  of  a performance  on  a pipe  organ.  When  the  recording  is 
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the  graph  was  generated.  In  this  case.  , uniform  decrease  in 
corresponds  to  a hoozcntal  line  moving  from  a high  level  on  the  graph  down- 
ward for  some  distance.  An  explanation  of  the  relative  softening  of  lcZrd  huh 
frequencies  would  note  that  . decrease  in  sound  level  passes  a greater  number  ff 
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FIG  21  Graph  representing  knowledge  about  the  way  in  which  loudness  depends  jointiy 
on  sound  intensity  and  frequency. 

contours  at  low  and  high  frequencies  than  at  medium  irequencies,  and  this 
corresponds  to  a greater  decrease  in  loudness  at  the  extremes  than  in  the  middle 
range. 


Conclusion 


In  this  section  I have  taken  up  issues  that  are  at  the  edge  of  available  theoretical 
concepts  in  cognitive  psychology.  The  theory  of  semap.ic  networks  is  quite  well 
developed  and  serves  to  represent  knowledge  structures  of  the  kind  we  try  to 
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teach  in  many  expository  subjects  such  as  psychology.  We  test  students’  knowl- 
edge by  asking  them  questions,  and  to  perform  successfully  they  are  required  to 
understand  the  questions  and  generate  appropriate  answers.  A substantial  begin- 
ning has  been  made  in  the  theory  of  language  comprehension,  and  some 
promising  suggestions  are  available  regarding  generative  processes  in  question 
answering.  The  available  theories  do  not  take  us  as  far  in  this  area  as  do  the 
available  theories  of  problem  solving,  bit  enough  is  understood  to  permit  a 
rough  sketch  of  the  kinds  of  processes  that  probably  are  involved. 


GENERAL  CONCLUSIONS 

My  goal  in  carrying  out  this  work  was  to  explore  the  applicability  of  current 
concepts  and  theories  in  cognitive  psychology  to  topics  actually  taught  in 
classroom  instruction.  The  results  have  been  encouraging.  I think  we  can  assert 
confidently  that  our  state  of  knowledge  and  understanding  in  cognitive  psychol- 
ogy has  now  developed  to  the  point  where  meaningful  contact  can  be  made  with 
the  content  of  instruction. 

i this  chapter,  the  concept  and  techniques  of  cognitive  psychology  have  been 
applied  to  analyses  of  inst;ur  tional  tasks.  Task  analysis  has  been  a major  activity 
of  educational  psycho!  jgists  for  some  time,  and  Resnick’s  chapter  3 in  this 
volume  provides  a review  and  discussion  of  instructional  task  analysis  in  relation 
to  cognitive  psychology. 

Careful  attention  to  components  of  instructional  tasks  is  potentially  helpful  in 
at  least  three  ways.  First,  it  aids  hi  the  design  and  evaluation  of  curriculum 
materials.  Secondly,  it  constitutes  useful  knowledge  for  teachers  who  have  the 
task  of  training  students  in  the  skills  and  understanding  that  are  represented  in 
the  theoretical  analyses.  Third,  it  probably  would  constitute  useful  information 
for  students  who  have  the  task  of  acquiring  the  skills  and  understanding 
represented  in  the  analyses. 

An  important  question  is  whether  task  analyses  that  are  more  strongly  em- 
bedded in  general  psychological  theory,  as  I have  attempted  to  embed  the 
illustrations  developed  in  this  chapter,  will  be  of  increased  usefulness  in  the 
practice  of  instruction.  It  would  be  pleasant  to  have  strong  reasons  for  a positive 
response  to  that  question,  but  it  seems  to  me  that  such  an  evaluation  must  come 
from  the  potential  users  of  the  product,  not  from  one  who  proposes  to  offer  the 
product  for  use.  A further  impediment  to  enthusiasm  now  is  the  fragmentary 
nature  of  the  illustrations  of  detailed  task  analysis  based  on  cognitive  theory.  A 
more  reasonable  evaluation  may  be  possible  when  we  can  display  a relatively 
complete  analysis  of  the  knowledge  desired  as  the  outcome  of  instruction  in 
some  subject  such  as  fractions  or  geometry  or  psychophysics.  Perhaps  the 
conclusion  can  be  drawn  from  the  present  work  that  it  is  reasonable  to  under- 
take such  an  analysis  using  concepts  that  are  currently  available  in  cognitive 
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psychology.  Of  course,  we  should  expect  that  an  effort  to  apply  current  theories 
will  show  some  needs  for  changing  the  theories.  But  we  have  appare1  .ly  reached 
a state  of  knowledge  and  understanding  that  provides  a reasonable  starting  basis 
for  development  of  instructional  objectives  based  on  j’eneral  psychological 
theory. 
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Impression  Formation, 
Discrepancy  from  Stereotype, 
and  Recognition  Memory1 

Ray  Hyman 

University  of  Oregon 


The  background  for  the  ch  in  this  chapter  includes  an  interest  in  the 
“prepared  mind”  (Hyman,  >%1,  1964a).  Under  what  conditions,  for  example, 
do  preconceptions  and  puo  knowledge  about  a problem  interfere  with  the 
restructuring  necessary  to  achieve  a solution  and  under  what  conditions  do  they 
help?  A particular  form  0/  this  question  is  concerned  with  the  role  of  dis- 
crepancy from  prototypes  in  guiding  the  course  of  inquiry  and  contributing  to 
the  growth  of  knowledge  (Brunswik,  1959;  Gombrich,  1961;  Hyman,  1964b; 
Kuhn,  1962;  Mischel,  1971). 

The  motivation  of  the  current  research  program  is  to  find  experimental 
paradigms  to  study  the  operations  of  schemata  and  discrepancies  frtm  proto- 
types in  science,  art,  and  everyday  affairs.  On  the  one  hand,  such  schemata 
contribute  to  distortions  of  reality,  missed  discoveries,  resistance  to  innovation, 
and  other  tendencies  to  assimilate  new  input  to  preconceived  viewpoints.  On  the 
other  hand,  they  are  necessary  precursors  to  the  recognition  of  important 
problems,  to  the  detection  of  anomalies,  and  to  their  own  eventual  adaptation  to 
the  disturbing  discrepancies. 

Although  the  experimental  paradigm  described  here  is  directed  primarily  to 
such  practical  issues,  both  the  paradigm  and  the  questions  it  was  designed  to 
answer  overlap  with  current  reseurch  in  psycholinguistics,  verbal  memory,  and 
cognitive  psychology.  This  overlap  might  be  taken  as  an  encouraging  sign  of  a 
narrowing  gap  between  issues  of  how  we  acquire  knowledge  in  the  real  world 
and  issues  being  studied  by  contemporary  cognitive  psychology. 

The  paradigm  is  based  on  the  impression-formation  task  used  in  studies  of 
person  perception  (Hastorf,  Schneide^,  & Polefka,  1970;  Warr  & Knapper, 


'The  experimenlal  data  were  collected  by  Janet  Polf  and  W.  Tram  Neill,  who  also 
contributed  both  to  the  design  and  the  analysis  of  the  results. 
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1968).  In  the  present  version,  the  subject  is  presented  with  the  description  of  a 
hypothetical  individual  (see  Table  I for  examples).  The  description  consists  of 
the  name  of  the  individual,  the  discipline  in  which  he  is  currently  majoring 
(Accounting,  Law,  or  Social  Work),  and  a character  sketch  written  around  ten 
adjectives  or  traits.  The  subject’s  task  is  to  read  the  description,  form  a coherent 
impression  of  the  individual  described,  and  then  circle  those  adjectives  on  a 
checklist  that  appropriately  describe  the  individual. 

After  performing  this  task  with  a small  number  of  sketches,  the  subject  is 
unexpectedly  tested  for  his  memory  of  the  actual  adjectives  used  in  the  descrip- 
tion of  each  hypothetical  individual.  In  the  experiment  to  be  reported,  the  test  is 
for  recognition  memory.  Using  the  same  list  of  adjectives  as  was  employed  for 
the  impression  task,  the  subject  now  indicates  which  ones  he  believes  were  in  the 
sketch  of  a given  individual  as  well  as  his  rated  confidence  in  the  judgment.1 2 

The  independent  variable  in  the  present  experiment  is  the  degree  to  which  the 
sketch  of  a hypothetical  individual  is  discrepant  from  the  sterotype  of  the 
category  to  which  he  is  assigned.  The  degree  of  discrepancy  is  determined 
operationally  by  having  a normative  sample  of  judges  rate  the  similarity  of  the 
individual  described  in  each  sketch  to  the  stereotype  for  each  of  the  majors. 


Schemata,  Prototypes,  and  Stereotype 

We  assume  that  our  subjects  possess  schemata  for  the  categories  consisting  of 
Accounting,  Law,  and  Social  Work  students.3  We  view  the  schema  foi  a given 
category  as  a system  or  set  of  criteria  by  means  of  which  the  subject  can  either 
generate  or  recognize  instances  that  are  members  of  the  category.  For  the 
impression  formation  task,  'J  e further  assume  that  the  schema  also  includes  an 
ideal  or  representative  instance  of  the  category-the  prototype- which  is  em- 
ployed as  a standard  against  which  new  instances  are  compared. 

A schema  for  the  category  of  Law  Student,  for  example,  would  consist  of  a 
prototype  of  a typical  law  student  along  with  rules  and  criteiia  th  it  specify  the 
type  and  range  of  permissible  transformations  that  can  be  peiformed  on  this 


1 The  literature  on  memory  indicates  that  the  i ffects  that  we  want  to  explore  within  this 
paradigm  will  most  likely  show  up  in  free  or  cued  recall  i-ther  than  in  recognition  memory 
(cf.  Kintsch,  1970).  Nevertheless,  we  employed  only  recognition  memory  in  our  initial 
experiment.  One  reason  for  this  choice  was  the  hope  that  we  could  obtain  many  more 
inteiesting  indices  of  how  the  discrepancy  from  stereotype  in  the  initial  descriptions 
eventually  showed  up  in  later  recognition.  In  our  subsequent  studies  we  plan  to  use  recall,  or 
a combination  of  recall  and  recognition. 

’The  concept  of  “schema”  has  been  used  in  a variety  of  ways  and  has  acquired  a variety 
of  connotations  within  the  field  of  psychology  (for  examples  see  Attneave,  1957b;  Bartlett, 

1932;  Evans,  1967;  Northway,  1940;  Oldfield,  1954,  Reed,  1973;  Woodworth,  1938).  In 
this  chapter  only  some  of  these  varied  connotations  are  intended.  We  make  no  assumptions, 
for  example,  about  how  the  schemata  originate  nor  about  the  specific  mechanisms  by  which 
they  operate. 
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prototype.  When  a prototype  for  a given  category  is  shared  by  members  of  a 
subculture,  as  are  the  prototypes  for  the  majors  used  in  this  experiment,  then  we 
call  the  prototype  a stereotype. 

Properties  of  the  Impression  Formation  Task 

The  impression  that  tlie  subject  forms  on  the  basis  of  a small  set  of  traits  seems 
to  correspond  closely  to  Bartlett’s  notion  of  a schema  (Bartlett,  1932).  Bartlett  s 
use  of  the  concept  of  “schema”  has  been  criticized  as  b>  ing  vague  and  inconsis- 
tent (North way,  1940;  Oldfield  & Zangwill,  1942).  Ono  problem  is  that  what 
Bartlett  was  pointing  to,  while  real  enough,  is  elusive  pnd  hard  to  describe. 
Rather  than  being  a clearly  delineated  cognitive  structure,  Bartlett’s  schema 
seems  to  be  a diffuse,  quasi-affective  “organized  setting”  within  which  the 
subject  tries  to  make  sense  of  material  to  be  assimilated  or  to  be  recalled. 

Apparently,  the  subjects  readily  form  impressions  on  the  basis  of  a few  items 
of  information  about  an  individual.  Because  there  is  a great  deal  of  inter-and 
intra-srbject  consistency  in  the  impressions  that  are  formed,  social  psychologists 
attribute  an  “implicit  personality  theory”  to  their  subjects.  (Hastorf  et  al., 
1970).  These  implicit  personality  theories,  in  the  realm  of  interpersonal  percep- 
tion, seem  to  play  a role  much  like  Schank’s  (1972)  system  of  conceptual 
dependencies. 

What  is  of  interest  for  the  purposes  of  the  present  research  is  th;  fact  that 
subjects  form  coherent  impressions  even  when  given  information  that  is  inter- 
nally inconsistent.4  A major  objective  of  this  research  program  is  to  study  the 
differential  effects  upon  memory  of  achieving  such  coherent  impressions  of 
information  that  vary  in  their  internal  consistency  or  compatibility.  Thus,  we 
attempt  to  experimentally  manipulate  the  degree  of  inconsistency  while  ensuring 
that  the  subject  will  be  able  to  achieve  a coherent  impression. 

The  adjectives  chosen  by  the  subject  to  indicate  his  impression  provide  two 
kinds  of  information  about  his  “inferences”  from  the  description  he  is  given. 
First,  they  indicate  the  relative  influence-upon  the  subject’s  initial  impression  of 
the  assigned  major  and  the  character  sketch.  Second,  they  provide  a baseline  for 
subsequent  memory  of  the  sketch.  We  can  evaluate  the  relative  influence  upon 
memory  of  those  adjectives  contained  in  the  sketch  and  tho„  'ised  by  the 
subject  in  his  initial  impression. 

Zangwill  (1972)  cites  some  studies  which  demonstrate  that  what  the  subject 
reproduces  in  recall  depends  heavily  upon  his  initial  response  to  the  original 

‘After  having  conducted  this  experiment,  we  have  discovered  that  subjects  do  not  always 
foim  an  integrated  impression.  In  the  experiment  by  Gollin  (1954)  less  than  one-fourth  of 
the  subjects  attempted  to  integrate  or  find  a coherent  basis  for  apparently  discrepant 
information  about  a given  person.  The  remaining  subjects  focussed  on  one  aspect  of  the 
information  while  ignoring  the  discrepant  aspect  or  included  both  aspects  without  attempt- 
ing to  reconcile  them. 
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stimulus.  Northway  (1940)  similarly  argues  that  what  the  subject  reproduces  in 
serial  reproduction  tasks  is  not  the  original  stimulus  but  his  initial  perception  of 
that  stimulus.  And  the  Russian  psychologist,  Smirnov  (1973)  concludes  that  the 
perception  of  the  original  material  proceeds  at  two  levels.  One  is  the  level  of 
actual  details  and  the  other  is  the  level  of  a more  abstract  impression  which  we 
achieve  in  our  effort  to  comprehend  the  material.  This  notion  is  much  like  that 
of  the  schema-with-correction  theme  as  employed  by  Attneave  (1957b)  and 
Posner  and  Keele  (1968,  1970).  The  important  point  is  that  memory  may  be 
guided  as  much,  if  not  more,  by  the  general  impression  formed  at  time  of  initial 
exposure  as  by  actual  content  of  the  stimulus. 


METHOD 


Stimulus  Materials 

The  stimulus  materials  consisted  of  three  personality  sketches,  each  attributed  to 
a hypothetical  individual.  Each  sketch  was  written  around  a set  of  ten  adjectives 
selected  on  the  basis  of  a normative  study  in  which  12  judges  were  presented 
with  a list  of  1 1 occupational  majors  such  as  Physical  Education,  Accounting, 
Law,  Medicine,  Etc.  The  judges  were  instructed  to  imagine  the  typical  graduate 
student  who  would  be  majoring  in  each  of  these  areas.  For  each  such  typical 
major,  the  judges  went  througi  Anderson’s  list  of  555  adjectives  (Anderson, 
1968)  and  circled  each  one  that  they  felt  to  be  descriptive  of  that  individual.  On 
the  basis  of  the  agreement  among  the  judges,  six  nonoverlapping  sets  of  10 
adjectives  were  selected  for  the  construction  of  six  separate  sketches.  Each 
sketch  was  written  to  fit  the  stereotype  of  a particular  occupational  major.  An 
attempt  was  made  to  employ  adjectives  that  were  unique  to  a single  major,  that 
were  not  strongly  negative  on  rated  likeableness,  anj  for  which  a reasonable 
number  of  equivalent  synonyms  existed  on  the  list  The  three  sketches  used  in 
the  present  study,  each  paired  with  the  occupation  il  major  to  which  it  is  most 
appropriate,  are  listed  in  Table  1. 

The  perceived  similarity  of  each  sketch  to  each  major  was  measured  in  a 
second  normative  study  employing  53  judges,  all  of  whom  were  drawn  from  the 
same  population  as  the  subjects  in  the  experiment.  The  judges  formed  an 
impression  of  the  individual  described  in  each  of  the  six  sketches.  Immediately 
after  reading  each  sketch,  each  judge  described  his  impression  of  the  target 
person  by  checking  the  appropriate  adjectives  on  a 200-word  checklist.  This 
200-word  checklist  had  been  constructed  from  Anderson’s  list  of  555  by 
eliminating  those  adjectives  that  were  never  or  rarely  used  in  the  first  normative 
study.  A set  1,  defined  as  the  normative  impression  for  a sketch,  was  derived  for 
each  sketch  from  these  initial  descriptions.  Each  adjective  that  was  circled  by 
60%  or  more  of  the  judges  was  included  in  the  impression  set  I for  that  sketch. 
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I 

I 

I 


TABLE  1 

The  Stimulus  Materials:  The  Three  Personality 
Sketches  Employed  in 
This  Experiment4 


MICHAEL  DECKER 

»“riV  ,CTnt!y  doil,g  *Iadua,e  work  in  Li*.  During  his  senior  year  in  high 
M,Chae*  D^^ws  school  counselor  wrote  the  following  description  of  him  based 
upon  interviews  and  psychological  tests: 

JET*  Person  who  often  is  impulsive  and  daring  in  his  thinking  and 

mg  skePu^’-  he  ‘ends  to  be  outspoken  in  his  opinions.  He  tackles  problemTin  an 
aggressive  manner.  He  is  both  talkative  and  forceful  in  his  social  interactions.  Although 
possessed  si  & fiery  temperament,  he  is  quite  shrewd  in  his  dealings  with  others. 

ROBERT  CAYWOOD 

in  S"  ?yr°n°dJ.4  'T'"^  d0ing  gladUate  WOfk  in  Accou»“ing-  During  his  senior  year 
m high  school,  Robert  Caywood’s  school  counselor  wrote  the  following  description  of  him 
based  upon  interviews  and  psychological  tests. 

Robert  is  basically  a cautious  and  thrifty  person.  His  outlook  is  crialistic  and  his 
classmates  probably  regard  him  as  insensitive  In  tackling  any  assignment  he  is  orderly  and 
thorough.  Because  he  is  serious  and  unassuming,  he  appears  to  be  socially  withdrawn.  Both 
politically  and  in  other  ways  he  is  a conformist. 


ANDREW  FLEMING 

CTfnUy  doin*  gradua,e  work  in  Social  Work.  During  his  senior  year 
m high  school,  Andrew  Fleming  s school  counselor  wrote  the  following  description  of  him 
based  upon  interviews  and  psychological  tests: 

Andrew  is  both  a warm  and  idealistic  person.  Basically  a trusting  individu_,  he  is  patient 
and  sympathetic  in  dealing  with  people.  His  classmates  describe  him  as  generous  and 
considerate.  He  is  genuinely  trier  ant  of  other  viewpoints  and  sincere  in  his  desire  to  listen 
Andrew  readily  agrees  that  b:,  outlook  is  sentimental 

'Each  sketch  is  paired  with  the  occupational  major  to  which  it  is  most  appropriate  in 
terms  of  judges  ratings.  The  ten  key  adjectives  are  italicized  in  each  sketch:  they  were  not 
italicized  in  the  actual  stimulus  materials. 


Next,  the  judges  directly  rated  the  similarity  of  the  sketch  to  the  typical 
graduate  student  in  each  of  10  occupational  majors.  The  ratings  were  made  on  a 
7;po'nt  *;fle  with  indicating  “very  similar”  and  “7”  indicating  “very 
Q.'ssimilar.”  Note  that  that  these  ratings  provide  us  with  a measure  for  our  basic 
independent  variable:  the  “similarity-to-stereotype”  for  each  possible  sketch- 
major  combination. 

Finally,  the  53  judges  indicated  their  impressions  of  each  of  the  10  occupa- 
tional majors  by  checking  items  on  the  200-word  checklist  described  above.  A 
set  A/,  defined  as  the  stereotype  for  a major,  was  constructed  from  those 
adjectives  endorsed  by  more  than  60%  of  the  iudges  for  a given  major 
From  these  normative  data,  the  three  sketches  and  the  three  occupational 
majors  to  be  used  in  the  present  study  were  selected.  Table  2 shows  the 
combinations  of  the  majors  and  sketches  used  with  each  experimental  condition 
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TABLE  2 

Tha  Major-Skatch  Pairings  Used 
in  Each  of  tha 
Experimental  Conditions0 


Experimental 

condition 

Sketch  Name 

CAYWOOD 

FLEMING 

DECKER 

A 

Accounting  (2.0) 

Law  (4.1) 

Social  work  (4.8) 

B 

Social  work  (6.2) 

Accounting  (5.6) 

Law  (2.2) 

C 

Law  (3.8) 

Social  work  (1.2) 

Accounting  (4.6) 

The  columns  indicate  the  sketch  name  and  the  rows  the  experimental 
condition.  Each  cell  indicates  the  assigned  major  for  the  condition.  The 
numbers  in  parentheses  are  the  average  ratings  of  slmilarity-to-stereotype  for 
each  major -sketch  pairing.  The  ratings  were  made  by  judges  on  a 7-point  scale 
such  that  “1”  indicates  “very  similar”  and  “7"  Indicates  “very  dissimilar.” 


along  with  the  sir  Hart  ty-to-stereotype  ratings  for  each  major-sketch  combina- 
tion.5 


The  Adjective  Check  List 

The  construction  and  the  composition  of  the  adjective  list,  which  is  the  constant 
instrument  upon  which  all  the  impressions  and  recognition  judgments  are 
mapped,  are  very  important.  A new  checklist  of  91  adjectives  was  constrr  ;d 
by  selecting  items  from  three  sets-the  character  sketch  itself,  C;  the  generalized 
impression  of  the  sketch,  I;  the  stereotype  of  an  occupation  major,  M— appropri- 
ate to  one  of  three  sketches  or  the  three  majors.  We  attempted  to  get  an  even 
distribution  of  adjectives  among  the  subsets  formed  by  the  setsC,  /,  and  M and 
their  complements.  The  greatest  set  overlap  is  between  the  stereotype  for  Social 
Worker  and  the  impression  for  ; Jeming  (10  of  22  adjectives);  the  least  is 
between  the  stereotype  for  Law  and  the  impressit  n for  Decker  (2  of  24 

‘Ideally,  the  arrangement  ot  sketches,  assigned  majors,  experimental  conditions,  and 
relative  similarity-to-stereotype  wouL  form  a Greco-Latin  square  with  each  of  these  fou 
factors  orthogonal  to  one  another.  We  could  not  achieve  such  complete  orthogonality  witt 
the  present  set  of  three  sketches  and  majors.  In  fact,  out  of  the  total  set  of  six  sketches  and 
majors  that  we  started  with,  we  could  form  only  one  balanced  Greco-Latin  square  with  a 
subset  of  three.  But  this  subset  had  undesirable  features  such  as  strong  overlap  between  two 
of  the  sketches  and  a very  wide  range  of  discrepancy  in  one  condition  and  practically  no 
range  in  discrepancy  for  another.  The  selection  employed  here  is  consequently  a com- 
r romise.  There  is  confounding  between  the  moderate  and  extreme  levels  of  discrepancy  in 
that  Law  is  never  paired  with  the  most  discrepant  case  in  any  of  the  three  experimental 
groups  while  Accounting  is  paired  with  the  most  discrepant  case  in  two  of  the  conditions. 
The  contrast  between  the  least  discrepant  case  and  the  other  two  levels  of  discrepancy, 
however,  is  orthogonal  to  the  other  three  factors. 
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adjectives).  The  stereotypes  for  Law  and  Accounting  share  two  adjectives  out  of 
a total  of  25  between  them.  Table  3 may  help  to  clarify  what  was  accomplftned 
in  the  two  normative  studies  that  supplied  the  sketches,  the  majors,  and  the 
checklist  for  the  present  experiment. 

The  checkiic*  was  constructed  so  that  a number  of  comparisons  could  be  made 
from  subject’s  responses.  Table  4,  for  example,  illustrates  the  partition  of  the 
checklist  relevant  to  the  analysis  of  responses  to  the  major-sketch  pairing  of 
(Fleming,  Social  Worker).  Table  4 provides  such  information  as  the  following: 
both  Andrew  Fleming  and  the  typical  major  in  Social  Work  are  considered  to  be 
kindly  and  pleasant;  Fleming  is  considered  to  be  good-humored  and  broad- 
ninded,  but  these  traits  are  not  part  of  the  stereotype  of  the  Social  Worker.  On 
the  other  hand,  the  typical  student  in  Social  Work  is  described  as  accessible  and 
dedicated,  adjectives  which  are  not  part  of  the  general  impression  of  Fleming. 
Also,  even  though  the  adjective  idealistic  is  used  in  the  sketch  of  Fleming,  the 
majority  of  judges  did  not  check  it  as  part  of  their  impression  of  him.  Later  we 
will  add  two  more  cross  paritions  of  the  check  list  for  more  detailed  ana'yses- 
one  includes  the  lir.t  of  adjectives  included  in  the  other  two  sketches  and  the 
other  includes  those  adjectives  actually  checked  by  a subject  as  descriptive  of  the 
combination  (Fleming,  Social  Worker). 

TABLE  3 

The  Normative  Studies  and  Their  Yields 


Study  1:  12  judges 

Judges  describe  each  of  11  occupational  majors  - 555-word  checklist  (Anderson, 
1968). 

Results: 

(i)  Six  nonoverlapm-ig  sets  of  10  adjectives,  each  set  appropriate  to  a different  one  of  6 
majors.  A character  sketch  is  written  around  each  set. 

(ii)  A reduced  list  oi  200  adjectives. 

Study  2:  53  judges 
Judges 

a.  Describe  the  sketches  on  200-word  checklist 

b.  Rate  each  sketch  for  similarity  to  stereotype  of  10  occupational  majors  on  scale 
from  “1"  (very  similar)  to  “7”  (very  dissimilar). 

c.  Describe  each  of  the  10  occupational  majors  on  the  200-v/ord  checklist. 

Results: 

i.  normative  impression  set  of  adjectives  for  each  sketch 

ii.  rated  similarity  of  each  sketch  to  each  major. 

iii.  stereotype  set  of  adjectives  for  each  major. 

From  these  results,  the  three  sketches  and  the  three  majors  were  selected  and  the 
^1-word  checklist  was  constructed. 
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TABLE  4 

The  Partition  of  tha  Checklist  Relevant  to 
Analyzing  tha  Results  for  tha  Pairing 
(Flaming,  Social  Work)4 


CTAf 

aw 

idealistic 

sentimental 

trusting 

generous 

ClW 

Cl M 

good-humored 

kindly 

soft-hearted 

pleasant 

trustworthy 

sensitive 

broadminded 

sociable 

cheerful 

thoughtful 

congenial 

acceptant 

coopera  the 

charitable 

emotional 

earnest 

gentle 

friendly 

helpful 

CTM 

OM 

(empty) 

patient 

sincere 

sympathetic 

tolerant 

warm 

considerate 

CTM 

erw 

accessible 

accurate 

adaptive 

analytical 

dedicated 

careful 
cautious,  etc. 

aC  stand*  for  the  set  of  adjectives  in  the  sketch  for 
Fleming;  / stands  for  the  set  of  adjectives  in  the  normative 
impression  of  this  sketch;  and  M stands  for  the  set  of  adjec- 
tives in  the  stereotype  of  Social  Work.  C,  * lif  stand  for  ihe 
comp'ements  of  these  sets. 


Subjects.  Forty -seven  subjects,  taking  undergraduate  psychology  courses,  were 
asked  to  participate  in  an  experiment  on  impression  formation  for  which  they 
would  be  paid  $3.00.  They  were  told  that  the  experimental  session  would  take 
between  if  to  2 hr. 

Procedure.  Each  subject  was  given  a booklet  with  detailed  instructions.  Each 
subject  could  go  through  the  entire  experimental  session  at  his  own  pace.  Total 
time  varied  from  30  to  90  min.  The  subject  was  told  that  he  would  be  reading 
three  brief  sketches,  each  describing  a different  person.  He  was  to  form  an 
impression  of  the  person  described  and  then  record  which  adjectives  on  the 
accompanying  lists  seemed  to  fit  his  overall  impression.  He  was  to  read  a sketch 
for  as  long  as  he  felt  necessary.  He  was  to  think  about  what  the  person  was  like 
overall.  Once  he  felt  he  had  formed  an  impression  of  the  person,  he  was  to  turn 
the  page  to  the  list  of  adjectives.  Without  looking  bark  to  the  sketch, he  was  to 
read  down  the  list  of  adjectives.  Whenever  he  encountered  a word  that  seemed  to 
fit  his  idea  of  what  the  person  was  like,  he  was  to  c rcle  it.  The  subject  did  this 
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for  each  of  the  three  sketches  in  turn.  The  instructions  were  repeated  prior  tc 
each  sketch. 

Upon  complei-  • the  checklist  for  the  third  sketcis,  subject  came  upon  ti  e 
following  instructions: 

In  the  Tint  part  we  asked  you  to  form  an  impression  of  a person  on  the  basis  of  the 
description  we  supp’ied  to  you.  At  that  time  we  did  not  tell  you  that  you  were  to 
remember  anything  about  the  description.  But  in  this  second  part  of  the  experiment,  we 
are  going  to  ask  you  to  remember  as  best  you  can  those  adjectives  that  w e actually  in 
•he  descriptions  we  supplied  to  you.  We  are  aware  that  your  ability  to  recognize  that  an 
adjective  was  or  was  not  in  the  sketch  of  a given  person  cannot  be  done  with  absolute 
certainty. 

Detailed  instructions  about  how  to  employ  the  6-point  rating  scale  followed. 
For  each  adjective,  subject  was  to  circle  “1”  if  he  was  very  confident  that  the 
word  did  appear  in  the  sketch;  “2”  if  he  was  reasonably  confident  that  the  word 
did  appear;  “3”  if  he  was  slightly  confident  that  it  did  appear;  “4”  if  he  was 
slightly  confident  that  the  word  did  not  appear,  etc.  We  did  not  allow  a neutral 
category  on  the  scale  because  our  preliminary  experiments  indicated  that  sub- 
jects tend  to  overuse  the  middle  category. 

The  subject  ther  turned  the  page  and  encountered  the  list  of  91  adjectives 
with  the  rating  scales  beside  each  one.  At  the  very  top  of  the  page  was  the  name 
and  major  of  the  major-sketch  combination  that  he  was  trying  to  remember. 
When  he  completed  his  ratings  for  one  sketch,  he  turned  the  page  to  a repetition 
of  the  instructions  in  case  he  needed  to  refresh  his  memory  for  them.  He  then 
turned  to  the  recognition  task  for  the  second  sketch;  and  then  finally  to  the  last 
sketch. 

The  experiment  was  conducted  in  a large  classroom  with  the  subjects  seated 
such  that  all  could  be  monitored  by  three  experimenters  who  were  present 
throughout  the  session. 


RESULTS 

The  results  are  based  upon  the  data  from  44  of  the  47  subjects  distributed  as 
follows;  14  in  Condition  A,  15  in  Condition  B,  and  15  in  Condition  C (Three 
subjects  failed  to  follow  instructions). 

The  Impression  Formation  Task 

Figure  1 summarizes  the  data  of  how  th<  subjects  used  the  checklist  categories 
to  describe  their  impressions  of  the  sketches.  The  categories  C (the  sketch 
adjectives),  CY  (adjectives  in  normative  impression,  cut  not  in  sketch)  and  K 
(adjectives  in  the  other  two  sketches)  and  Base  (all  the  remaining  adjectives) 
were  employed  because  they  consisted  of  identical  adjectives  for  a given  sketch 
and  all  three  category  assignments.  The  data  are  pooled  across  the  three 
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L LEAST  APPROPRIATE  CATEGORY 
I*  MOST  APPROPRIATE  CATEGORY 
N'NEXT  APPROPRIATE  CATEGORY 
*'MttCA.eS  ASSAYED  CATEGORY 


~i  s w "t"  « 7 

AVERAGE  SIMILARITY  TO  STEREOTYPE 

FIG.  2 Proportion  of  adjectives  selected  from  ihe  three  stereotype  categories  on  the 
checklist  during  ihe  impression  formation  task.  The  abscissa  has  the  same  meaning  as  in  Fig. 
I.  The  arrows  indicate  the  assigned  category. 

the  checklist.  The  top  line  on  the  graph,  for  example,  shows  the  tendency  to  use 
^d>ectives  in  the  most  appropriate  category  (the  stereotype  which  it  closest  to 
the  rJcetch)  when  that  category  is  the  assigned  one  and  ».  !»tcn  it  is  not.  Here  it 
looks  as  if  the  tendency  to  use  adjectives  from  the  appropriate  stereotype  is 
greater  when  that  is  also  the  assigned  stereotype.  Although  this  is  a reasonable 
and  expected  finding,  not  toe  much  reliance  should  be  placed  upon  it  because  it 
is  mainly  due  to  one  sketch. 

With  one  exception,  the  tendency  to  use  adjectives  from  the  various  stereotype 
categories  seems  to  be  determined  almost  entirely  by  the  similarity  of  the  sketch 
to  the  stereotype  rather  than  by  the  assignment  of  the  sketch  to  a major  The 
exception  is  for  the  intermediate  case.  In  this  latter  case  it  does  appear  that 
assigning  a sketch  to  a major  that  is  moderately  discrepant  (rather  tlian  ex- 
tremely discrepant)  does  result  in  an  enhanced  tendency  to  use  adjectives  in  this 
category  to  describe  the  impression.  Although  this  finding  of  some  assimilation 
to  the  stereotype  for  the  moderately  deviant  assignment  is  consistent  with  our 
predictions,  the  effect  is  rather  small  and  local  (in  the  sense  that  it  does  not  seem 
to  affect  other  categories  of  the  description).* 

‘Subrequenl  analyses  of  individual  adjectives  and  a replication  experiment  wiih  a much 
larger  sample  confirms  the  facl  that  the  asuignmenl  of  a major  lo  a sketch  affects  both  the 
impression  and  Ihe  recognition  task  The  analysts  by  large  categories  of  ad|eclrves  rm*t  Ihrs 
efTecI  because  many  of  the  adjectives  do  not  discriminate  because  they  art  easily  rejected  as 
being  irrelevant  to  a parlicul.tr  sketch. 
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The  Recogr  ition  Task 

The  dependent  variable  for  evaluating  the  suhier-.V  „ 

adjectives  in  the  sketches  is  simolv  tL  J recognition  memory  for  the 

of  adjectives.  ^ the  avera*e  ratb*  for  the  various  categories 

Figure  3 summarizes  the  basic  ™ .u 

preceding  two  figures  the  results  are  a-  i recognit,on  task'  As  in  the 
similarity  Urn  L ,o  ' V °f  ""  ■«"* 

and  assigned  major  has  lit  tie  effect.  ’ d,SC‘Cpancy  be,ween  sketch 

foS'by  SSS  ethTr?  'a,i"8S  i’,0k"'  d°'™  **  * ‘■•‘norms 

(baseline, ^into  n C(ik"Ch  *«“'*">•  a "*  » 

not  used  in  his  ov-n  description  (B)  Rn  " d,:scrlPtl0n  (D)  and  those 

subjects  consistently  tend  to  rate  the  adi'  ^ ° ™d  the  D cate8°ries,  the 
as  more  hkely  to  have  been  sket^Z  ib  T ^ in  0,6  Sketch 

normative  associates  (Cf)  In  Fig  3 this  effe  » i heydo  the  adJectives  in  the 
subjects’  descriptions  than  fo ZTf™  “ larger  f°r  ,hose  ^ives  in  the 

^ s 


represents  the  rating  ^kZc^rangeT ^^'7”  SI«=reo.ype.  The  ordinate 

sketch,  through  “6,”  very  confident  that  adtective’i*  * C°r  'dent  ’hal  adiec,Ive  was  m th' 
on  the  data  from  44  subjects.  aS  not  1,1  l^e  s*cet^h.  Each  point  is  based 
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the  same  sire  for  both  the  adjectives  used  by  the  subjects  in  their  descriptions  of 
the  sketches  and  for  those  not  used  in  their  descriptions.  Finally,  for  each  of  the 
three  categories,  the  adjectives  actually  used  in  the  subjects’  descriptions  of  their 
sketch  impressions  tend  to  be  consistently  rated  as  more  likely  to  have  been  in 
the  sketch  than  those  not  used  by  the  subjects  in  their  descriptions. 

Figure  3 suggest.,,  then,  that  the  recognition  ratings  can  be  explained  or 
accounted  for  by  three  approximately  additive  main  effects.  One  effect  is  due  to 
the  adjective  actually  having  been  in  the  sketch;  another  effect  is  due  to  the 
adjective  being  a member  of  the  impression  of  that  sketch  generated  by  a group 
of  judges;  the  third  effect  is  due  to  the  adjective  being  one  of  those  actually  used 
by  the  subject  in  his  description  of  the  immediate  impression  he  formed  of  the 
sketch-major  combination.  Our  analysis  of  variances  on  the  three  sketches 
confirms  this  impression.  The  reader  should  compare  Fig.  3 with  Fig.  1 to  note 
how  closely  the  recognition  data  mirror  the  descriptions  made  during  the 
impression  formation  task. 


DISCUSSION 

This  chapter  examines  one  of  a contemplated  series  of  studies  that  will  explore 
the  usefulness  of  the  impression-formation  task  as  a paradigm  for  investigating 
issues  related  to  the  restructuring  of  memory  and  the  acquisition  of  knowledge. 
Two  issues  of  concern  in  the  present  experiment  are  the  role  of  the  subject’s 
immediate  impression  upon  his  subsequent  memory  for  the  sketch  and  the  effect 
of  assigning  the  sketch  to  compatible  or  discrepant  majors.  The  data  suggest  that 
neither  the  subject’s  initial  impression  nor  the  major  to  which  the  hypothetical 
individual  was  assigned  have  any  appreciable  aff  :ct  upon  his  recognition  memory 
for  which  adjectives  actually  were  included  in  the  sketch.  Instead,  two  approxi- 
mately additive  components  appear  to  determine  ti.e  subject’s  tendency  to  judge 
an  adjective  as  having  been  in  the  sketch.  One  component  is  v hether  or  not  the 
adjective  actually  did  occur  in  the  sketch.  And  the  other  component  is  whether 
the  adjective  belongs  to  the  set  of  normative  adjectives  that  judges  have  checked 
as  characteristic  of  the  individual  described  in  the  sketch. 

The  analysis  of  variance  indicate  that  the  subject’s  confidence  ratings  can  be 
accounted  for  by  three  additive  main  effects.  One  main  effect  results  from  the 
adjective’s  having  been  in  the  sketch.  The  second  results  from  the  adjective  being 
a member  of  the  normative  inpression  set  generated  by  the  sketch.  The  tnird 
main  effect  is  due  to  whether  the  adjective  was  or  was  not  in  the  subject’s  own 
immediate  description  of  the  mpression  created  by  the  sketch-major  combina- 
tion 

The  lack  of  any  interaction  between  the  checklist  categories  and  the  usage  of 
adjectives  in  the  impression  task  along  with  the  parallel  outcomes  from  both  the 
impression  and  recognition  tasks  suggests  that  we  have  two  dependent  variables 
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which  both  reflect  the  same  effects.  The  fact  that  one  dependent  variable  is 
obtained  prior  to  the  other  apparently  does  not  have  any  effect  on  the  outcome. 

The  use  of  “determine"  in  the  preceding  paragraphs  was  not  intended  to 
indicate  causality.  While  it  makes  sense  to  view  the  occurrence  of  an  adjective  in 
the  stimulus  sketch  as  “causing"  an  increment  in  recognition  memory  for  that 
adjective,  it  makes  much  less  sense  to  treat  the  normative  category  of  adjectives 
in  the  skftm''  impression  as  a causal  agent.  Presumably  the  adjectives  in  the 
sketch  cause  suoject  to  generate  or  retrieve  a category  or  schema  for 
describing  the  indiv.duil  in  the  sketch.  This  personal  category  or  impression  then 
becomes  the  sourc  or  causal  agent  for  increasing  the  subject’s  tendency  to 
falsely  recognize  an  adjective  as  having  been  in  the  sketch.  This  personal 
impression  is  partially  correlated  with  both  the  normative  impression  of  the 
sketch  and  with  the  subject’s  choice  of  adjectives  in  the  impression  task. 

It  is  possible  that  the  effect  of  the  subject’s  immediate  impression  of  the 
stimulus  might  show  up  in  recognition  memory  with  a longer  interval  between 
stimulus  presentation  and  subsequent  testing  for  retention.  As  mentioned  in  the 
introduction,  it  is  even  more  likely  that  such  an  effect  will  show  up  in  recall 
rather  than  recognition.  The  da*a  on  memory  for  visual  form  (Riley,  1962)  and 
vario’is  attempts  to  replicate  and  elaborate  Bartlett’s  work  (Zangwill,  1972) 
indicate  that  reproduction  and  recall  are  heavily  influenced  by  the  subject’s 
initial  response  to  a stimulus  but  that  recognition  memory  may  or  may  not  be  a 
function  of  this  initial  reaction.7 

The  most  important  result  is  the  fact  that  recognition  memory  seems  to  be 
strongly  influenced  by  two  approximately  additive  components.  Possibly  there  is 
also  a slight  influence  of  the  stereotype  of  the  assigned  major.  Future  studies  will 
investigate  how  these  components  fare  over  longer  intervals  of  retention  and  in 
terms  of  other  indices  of  retention.  From  the  work  on  pattern  recognition  (Hyman 
A Frost,  1974  Posner  A Kecle,  1968,  1970)  and  from  some  of  the  wor1,  or 
semantic  memory  (Barclay.  1973;  Bransford,  Barcley,  A Franks,  1972;  Brans- 
ford  A Franks,  1971);  we  would  expect  that  with  increasing  retention  intervals 
or  with  heavier  memory  loads  the  memory  of  the  specific  adjectives  or  details  of 
the  sketch  would  fade  to  zero  or  to  a negligible  level  while  the  tendency  to  base 
the  memory  upon  the  general  impression  will  remain  stable.  The  so-called 
“reconstructive"  aspects  of  the  memory  will  then  dominate  subsequent  reten- 
tion. Whether  this  latter  result  is  seen  as  a defect  or  a virtue  of  memory  will 
depend  upon  the  task  and  the  goals  set  by  the  experimenter,  the  educator,  or  the 
student.  It  is  interesting  to  note  that  in  the  pattern-recognition  studies  it  is 
considered  a virtue  to  respoH  in  terms  of  a generalized  impression  or  abstracted 


’Since  ihe  present  experiment  was  reported  we  have  completed  a new  study  in  which  we 
used  both  recall  and  recognition  measures.  The  recall  scores  show  the  same  pattern  of  results 
as  do  the  recognition. 
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image,  whereas  in  recognition-memory  performance  is  scored  in  terms  of  ability 
to  discriminate  the  actual  details  from  the  general  impression. 

Oui  manipulation  of  discrepancy  from  stereotype  represents  only  one  of  many 
possible  ways  to  lanipulate  discrepancy.  Our  majors  “Law  Student,”  “Account- 
ing Student,”  and  “Social-Work  Student”  are  by  no  means  mutually  exclusive. 
The  set  of  adjectives  in  the  stereotype  for  an  Accounting  Major  include  descrip- 
tors such  as  “analytical,”  “careful,”  “consistent,”  “methodical  ” “systematic,” 
etc.  Almost  all  of  these  descriptors  seem  to  deal  with  work  habits  and  ways  of 
coping  with  problems.  Not  a single  adjective  in  this  stereotype  list  for  Accoun- 
tant refers  to  interpersonal  relati  nships. 

The  adjectives  in  the  stereotype  for  Social  Work  major,  on  the  other  hand, 
include  such  descriptors  as  “accessible,”  “adaptive,”  “acceptant  ” “charitable  ” 
“friendly,”  “helpful,”  “kindly,”  “sensitive,"  etc.  With  practically  no  exceptions 
these  traits  all  deal  with  interpersonal  relationships.  Thus,  there  is  no  incom- 
patibility between  the  stereotypes  of  the  major  in  Accounting  and  the  major  in 
Social  Work.  In  thinking  of  an  Accountant  major,  one  does  not  typically 
consider  his  interpersonal  relationships  and  vice  versa.  But  there  is  no  reason 
why  an  individual  cannot  simultaneously  fit  the  stereotypes  of  a student  in 
Accounting  and  a student  in  Social  Work.  One  way  to  create  true  discrepancy  is 
to  work  with  mutually  exclusive  categories  or  to  deliberately  construct  sketches 
which  contain  adjectives  that  are  antonyms  of  those  adjectives  that  describe  a 
given  stereotype. 

Another  related  problem  with  the  categories  we  have  used  is  that  they  are 
quite  broad  in  their  inclusiveness.  Our  hypotheses  about  relative  disci epancy 
from  stereotype  were  based  on  the  idea  that  the  category  of  each  of  the  majors 
was  bounded  in  the  sense  that  descriptions  of  individuals  who  were  quite 
different  from  the  stereotype  would  be  viewed  by  the  subjects  as  definitely  not  a 
member  of  that  category.  It  was  hoped  that  the  sketches  that  were  intermedia  e 
in  similarity  to  the  stereotype  would  be  seen  as  quite  different  from  the 
stereotype  but  still  within  the  permissible  bounds  of  variation  from  it.  Actually, 
our  categories  seem  to  be  relatively  unbounded.  Although  the  subjects  do  seem 
to  have  coherent  and  shared  ideas  of  what  the  typical  or  prototypical  student  in 
each  occupational  major  is  like,  they  also  probably  see  these  categories  as 
relatively  unbounded  in  that  any  sort  of  student  can  choose  to  major  in  any  of 
these  areas.  Po'aibly  we  would  have  more  clearly  bounded  categories  if  we 
assigned  sketches  to  the  actual  occupations  rather  than  to  students  who  are 
training  for  that  occupation. 

Still  another  ieason  why  the  assigned  major  had  little  effect  upon  the  impres- 
sion and  the  recognition  tasks  might  be  the  fact  that  the  sketches  were  wntten 
so  as  to  be  internally  coherent  and  cohesive.  The  sketches  generated  a consistent 
impression  of  an  individual  without  any  help  from  the  assigned  major.  It  is 
possible  that  the  assignment  will  have  a noticeable  effect  only  when  the  stimulus 
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is  vague,  ambiguous,  or  internally  contradictory  (cf.  Bransford  & Johnson,  1973 
for  related  evidence).  In  the  studies  of  labeling  and  memory  for  visual  form 
(Riley,  1962)  the  stimulus  objects  are  typically  ambiguous. 

A major  purpose  of  this  chapter  is  to  indicate  the  promise  of  the  impression- 
formation  task  as  a paradigm  for  the  experimental  investigation  of  a variety  of 
questions  related  to  the  restructuring  of  memoiy,  the  acquisition  of  new  infor- 
mation, the  assimilation  of  new  inputs  to  existing  representations,  and  the 
accommodation  of  existing  representations  to  fit  the  new  inputs.  From  our 
preliminary  experiments  and  from  our  experience  with  the  experiment  discussed 
here,  we  are  encouraged  by  many  features  of  this  paradigm.  For  one  thing,  the 
subjects  say  that  they  find  the  task  meaningful  and  quite  relevant  to  their  daily 
activities.  For  another  thing,  individual  behavior  is  surprisingly  consistent  and 
well  predicted  from  population  norms  collected  on  moderately  sized  samples. 

Using  the  same  stimulus  materials  and  paradigm,  we  plan  to  study  the  effects 
on  other  ind'ces  of  memory  such  as  recall  and  the  effects  of  longer  re:ention 
intervals.  Other  extensions  of  this  work  will  involve  changing  the  manner  of 
manipulating  disciepancy  from  prototype  as  well  as  changing  the  stimulus 
materials.  For  example,  we  plan  to  make  sketches  internally  ambiguous  in  the 
sense  that  the  iu,ectives  used  will  be  drawn  from  two  or  more  different 
stereotypes.  The  assignment  of  the  sketch  to  one  or  the  other  of  potentially 
relevant  categories  should  result  in  relatively  more  emphasis  on  the  relevant 
subset  of  adjectives  both  in  forming  the  impression  and  in  subsequent  memory. 
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.h.,  whk,,  , p,.»,  , complex  xubject  me,!,,.  B, 

complex,  w.  mean  somellnng  hai  lake,  a con«de„ble  amoun,  of  lime  lo 
cam  time  measured  in  weeks  ;o>d  months,  not  in  hours  or  days.  Complex 
opics  contain  a large  structure  of  information  composed  of  the  relevant  con- 
cepts and  processes  that  make  up  the  top.c.  A large  amount  of  tune  is  necesary 
just  to  incorporate  such  a mass  of  material  into  a person's  memory  structure 
Moreover,  she.r  rote  acquisition  of  the  concepts  is  not  enough.  The  matenal 
must  be  structired  in  such  a way  that  relevant  concepts  can  be  related  properly 
one  another.  The  procedures  must  be  learned  well  enough  that  they  canbe 
per  ormed  when  needed,  and  more  important,  so  that  they  can  be  performed 
when  the  situation  is  not  quite  the  same  as  when  the  concept  was  learned. 

Before  we  can  make  much  progress  towards  the  understanding  of  how  com- 
plex topics  are  learned,  we  need  to  know  about  the  organization  of  knowledge 
witlun  human  memory  »Ve  contend  that  our  relative  lack  of  knowledge  abt  at 
the  learning  process  is  a direct  reflection  of  our  relative  lack  of  knowledge  about 
the  structure  of  human  memory.  Things  are  changmg,  however.  Psychology  is 
now  coming  to  understand  memory  structure  better.  In  lum,  we  are  now 
starting  to  get  a better  understanding  of  the  process  of  learning. 

In  this  chapter  we  examine  some  current  ideas  about  memory  structure  and 
show  their  relevance  toward  the  study  of  learning.  We  discuss  the  organization  of 
asic  memory  units  around  frames  or  schemata  and  more  especially  the  way  by 
which  a person  comes  to  modify  these  units.  The  overall  result  of  this  endeavor, 
o us  at  least,  is  both  exciting  and  disappointing.  It  is  exciting  because  it  appears 
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to  show  how  recent  work  in  the  study  of  cognitive  systems  can  potentially  be  of 
value  to  our  deeper  understanding  of  the  problems  involved  in  education.  It  is 
disappointing,  because  when  we  are  done,  it  can  be  said  that  all  that  has  been 
a-  complished  is  the  statement  of  old  knowledge  and  understanding  into  a new 
terminology.  Despite  ail  this,  we  are  optimistic  that  the  new  temiiric.’ogy, 
coupled  with  our  understanding  of  cognitive  T'octssrs  can  eventually  lead  to  a 
deeper  understanding  of  learning. 


MEMORY  REPRESENTATION 


Semantic  Networks 

A number  of  workers  in  the  field  of  human  nemory  have  recently  been 
developing  forma!  representations  for  the  knowledge  within  memory.  Most  of 
the  models  that  we  are  interested  in  here  are  described  as  semantic  newurks. 
Following  the  work  of  Quillian  (1968),  they  are  all  characterized  by  a directed, 
labeled  graph  structure,  such  as  those  shown  if  Figs.  I.  and  3.  We  call  our 
version  of  a semantic  network  an  active  structural  netw'ork  n emphasize  that  the 
representation  is  both  active  and  passive.  It  can  contain  general  procedures  that 
can  be  executed  whenever  functional  knowledge  must  be  u.ed  (This  work  is 
reported  in  Norman.  Rumelhart,  and  the  LNR1  research  group.  '975  from  now- 
on  we  refer  to  this  work  as  “LNR”).  Although  we  will  base  oui  discussion  on 
this  work  the  comments  will  apply  to  all  semantic  network  representations. 

Basically,  the  semantic  network  provides  a means  of  representin;  knowledge 
It  is  a new  tool  in  psychology.  Previously,  our  formal  models  have  been  abstract 
a mathematical  learning  model,  for  example,  talks  of  the  probability  or  the 
strength  of  some  association  between  two  elements,  but  the  elements  zte  usually 
part  of  a large  homogeneous  set.  With  semantic  networks,  we  took  at  the 
structure  of  very  particular  items. 


The  Internal  Components  ot  Verbs 

Consider,  for  example,  how  a child  might  come  to  learn  language  In  the  work  of 
the  LNR  research  group,  the  meanings  of  verbs  can  be  decomposed  into 
underlying  primitive  elements.  Thus,  some  verbs  specify  only  STATIVF  com- 
ponents; others  are  more  complex,  specifying  CAUSF  and  CHANGF  When  we 
say  that. 

( 1 ) The  skier  went  to  the  top  to  the  mountain, 
there  is  a change  of  location  of  the  skier  as  shown  in  Fig.  I 


The  actonyrnn  LNR  represents  the  reieaich  proup  at  the  l nrveruty  of  ( jhfotnu.  San 
thepo  which  hat  studied  these  issues  The  |troup  was  otkiiuHv  formed  and  tupemted  by 
Peter  Lindsay.  Donald  Norman,  and  David  Rumethan  hence.  LNR 
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FIG.  1 The  ikier  «ent  to  the  top  of  *Jk  mountain 

The  figure  shows  the  decomposition  of  the  verb  “went"  into  it?  more  basic 
underlying  components,  ijt..  its  network  structure  Network  structures  are 
composed  of  nodes  and  ordered,  labeled  relations  connecting  these  nodes.  In 
Fig.  I,  the  ovals  represent  modes  that  are  token  instances  of  propositional 
structures,  the  angular  brackets  (e.g..  skier)  represent  nodes  that  art  token 
instances  of  concepts,  and  the  phrases  enclosed  in  quotation  marks  fe.g.,  “bottom 
of  mountain")  represent  network  structures  for  the  concepts  described  by  the 
phrases,  but  which  we  have  not  shown  in  detail  in  the  figure  in  order  to  maintain 
the  clarity  of  the  diagram.  In  the  LNR  active  structural  network,  there  are  four 
different  types  of  node  structures,  two  of  which  are  shown  in  Fig.  I.  The 
numbers  of  the  token  propositional  nodes  have  no  meaning  they  are  used  solely 
to  facilitate  the  discussion  of  these  nodes. 

Figure  I can  be  interpreted  in  a straightforward  manner  by  starting  with  node 
•I.  the  oval  labeled  CHANGF  This  indicated  that  some  change  of  states  has 
taken  place  the  relations  leaving  node  *1  describe  the  states  that  are  involved. 
The  CHANGE  takes  place  from  a state  shown  by  the  node  This  nede  says 
that  tts  subject  (a  skier)  was  located  at  ihe  "bottom  of  mountam"  from  some 
unknown  time  to  some  time  not  specified  (but  indicated  by  the  unnamed  node 
shown  as  angular  brackets)  The  result  of  the  change  ts  the  state  represented  b> 
rr  Je  *3  the  skier  is  now  as  location  “top  of  mountain."  Notice  that  the  time  lie 
was  located  at  5h  top  of  the  mowitatn  it  not  specified,  except  to  indicate  that  it 
is  later  than  the  lx  x at  which  he  was  no  longer  at  the  bottom  of  the  mountain 

A mote  complex  verb  is  one  that  implies  causality  sav  "to  gr*e  " If  we  say 

(2)  Surette  took  the  skis  from  Henry 

we  mean  that  Su/ette  did  somethmg  that  caused  the  skn  to  change  from 
Henry’s  possession  to  her  possession  We  illustrate  this  sentencx  with  the  struc- 
ture shown  in  Fig  2. 
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FIG.  2 Siuelle  look  the  ikit  fiom  Henry. 


Noie  that  these  Hructure,  provide  framework,  for  fluent  knowledge 
Each  structure  u a memory  schema  it  allow,  u,  to  organiz.  the  material  that  we 
leim  U(er  on.  Thus,  if  we  learn  that 


(2B)  She  promised  to  return  them  by  morning. 

we  know  that  there  is  an  obligation  to  return  the  *kit.  From  that  we  can  deduce 
that  ihe  got  them  from  Henry  with  hi,  permmron,  and  in  fact,  we  expand  the 
framework  for  the  knowledge  to  «ome thing  like  that  shown  in  Fig  3.  Note  that 
»e  add  the  new  knowledge  directly  to  the  framework  for  the  old.  Nor  only  did 
we  have  a convenient  way  of  modifying  the  structure  for  the  previous  episode 
according  to  the  framework  provided  by  the  schema  for  the  first  sentence,  but 
we  have  now  modified  the  structure  into  one  that  is  equivalent  to  “borrow  - 
^tte  borrowed  the  *is  J,  i,  easy  to  tee  how  other  statement,  could  have 
modified  the  structure  to  indicate  that 


oureue  stole  the  Hu,  (no  permiirion  wa,  granted) 

Surette  purchased  or  rented  the  Ait  (die  paid  money  for  them) 

g"StLSTlS>  *“*  ^ llkm8  PC,M  IO  pkk  lhCTn  fot  hfr  («P«ding 


Developmental  Studies 

The  schemata  provided  by  this  form  of  structural  analyst  turn  cut  to  be  rather 
powerful  Dedre  Centner  (1 975)  shows  how  a r amber  of  the  verbs  of  possession 
can  be  analysed  m this  way.  More  important  for  present  purposes,  she  reports  on 
experiment,  wtuch  show  that  a person  , memory  of  the  actual  verb  used  in  a 
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FIG.  3 In  'had'd  area'  Sureite  HK.k  the  f,on,  Ferny.  Sot  'had'd  She  promised  lo 
relum  Ihem  by  tomorrow  Total 'nurture  Suzetle  bor  owed  the  skis  firm  Henry  promising 
lo  return  them  the  next  day 

icntena  will  vary  systematically  with  the  other  information  provided,  much  in 
the  manner  of  the  illustrations  we  have  already  provided. 

Centner  has  also  studied  the  order  in  which  these  structures  are  learned.  If  we 
look  at  Fig.  4 we  *e  the  underlying  components  that  she  has  analyzed  for  the 
structures  of  the  verbs  “give,  take,  buy,  sell,  spend  money,  trade."  Centner 
shows  how  one  can  derive  an  ordering  for  the  acquisition  of  the  underlying 
components,  and  therefore  for  the  verbs  themselves.  In  fact,  she  has  performed 
the  necessary  experiments  She  seated  children  in  front  of  tables  which  had  two 
doUs  (Bert  and  Ernie)  and  she  asked  the  children  to  make  one  doll  buy  sell  give 
and  take  toys  from  the  other  The  detrls  are  reported  in  her  paper,  but  the 
important  aspect  is  that  the  developmental  sequence  of  acquisition  followed 
th'  expectations  rather  me  ly  . See  Fig  5.  You  might  note  that  the  structures  of 
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FIG.  * The  rrliihKuhrpt  urvxi*  the  concept*  urxScr^ir*  the  »rrb»  pvt  ~ ~ukt.~  >i> 
**H.  and  ".pend  mone)  " The  verruntK  component*  m*d  tSr  iuk«  that  prrrai!  la* 
understanding  of  l.*e  mbi  in  reprevented  by  the  th >M  o**U  Age  of  tin  ^ rv 

*rib»  proceed*  verticalt),  >ounge»t  age  at  the  to?  ||  roan  (.enfart  » 


these  verbs  arc  reavxubly  complex.  The  time  course  of  the  acq  uisition  of  the 
verbs  follows  the  theoretical  ordering  of  their  comptexitv  It  takes  hxklrer 
approximately  5 years  to  process  from  the  state  m which  they  use  words  bke 
pve  and  tak*  prop  rly  to  the  point  where  they  u*e  the  entire  set  o4 
possession  verbs  shown  >n  Fig  4 property  The  structural  network  detcTiptsccn 
for  these  verbs  show  why  this  long  time  period  «*<ght  be  necrsurv  (lwtarr 
must  team  about  a number  of  different  concepts  some  dealmg  with  social 
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Functional  schemata  will  of.en  contain  a conditional  statement  and  sets  of 
events  that  ate  to  be  performed  according  to  the  tutus  of  the  conditional 
tute 

The  S»««ct  o«'  of  Sctwmjti 

S^herruu  are  selected  for  use  accord*)  j to  then  wnlants  to  the  utiaation 
The  Uodticst^n  of  Schrmiu 


•»  irtsu^  the  mlnctaom  o i upMMM  ol  a i,hw  it  m aim*  p f>lr  lo 
mb  HIM  mw  MUtsa  •«  pw  two  ruwftn  of  tea  User  A libra  i Mast 
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The  mayonnaise  protocols  illustrate  that  previously  developed  schemata  can  be 
applied  to  new  problems,  not  necessarily  in  appropriate  fashion,  of  course.  The 
determinants  of  which  schemata  get  applied  are  the  features  which  come  from 
what  is  known  in  this  case,  the  features  of  the  end  result.  The  problem  is  that 
the  relevant  properties  of  egg  and  oil  and  their  interactions  (that  they  can  form  a 
white  creamy  substance)  are  not  part  of  any  stored  schema,  so  the  correct 
answer  cannot  be  derived.  As  the  examples  illustrate,  mayonnaise  has  properties 
which  make  it  look  more  like  certain  dairy  products  than  the  result  of  mixing 
r he  yolks  of  eggs  with  oil.2 

The  point  of  this  exercise  is  simple  (perhaps  too  simple  for  the  space  it  has 
occupied).  We  believe  that  this  use  of  old  schemata  thru  analogy  is  all  pervasive 
and  powerful.  Although  the  sauces  derived  by  our  two  subjects  are  not  at  all 
mayonnaise,  they  are  intelligent  creations,  and  they  are  not  bad  sauces  for  some 
purposes.  Normally,  this  creative  use  of  an  old  schema  for  a novel  purpose  is  an 
essential  use  of  the  creative  process  of  discovery. 

Later  we  return  to  this  issue  We  believe  that  the  mayonnaise  problem  is  an 
example  of  “functional  reasoning”.  Here  is  what  we  believe  is  involved 

1 No  knowledge  of  the  components  of  mayonnaise  existed. 

2.  It  is  known  that  mayonnaise  is  smooth,  creamy,  off-white  in  color.  Its  taste 
is  known  (and  it  is  somew  hat  acidic). 

3.  General  schema 

Blends  of  foods  blend  their  properties. 

Texture,  sour  cream  or  whipped  cream  yields  a color  that  is  too  white 
and  a taste  that  is  not  right. 

Correction  add  yellow-spicy  acidic  mustard. 

So  we  have 

Selection  of  schema  by  analogy 
Modification  towards  goal 

This  is  reasoning  by  analogy  In  functional  reasoning  we  claim  there  must  be  one 
more  step  the  schema  has  variables  in  it,  and  any  concepts  that  fulfill  appro- 
priate range  restrictions  on  those  variables  may  be  used  in  the  solution  of  a 
problem. 

Example  The  white  sauce  problem.  One  important  component  of  the 
process  of  learning  and  teaching  is  that  of  communication.  The  teacher  has  the 


‘To  mike  mayonnaise,  one  puts  2 eg*  yolks  in  a mixing  bowl  with  one  teaspoon  ot 
vinegar  or  lemon  juice  Seasonings  may  be  added  (dry  muslard,  salt,  white  peppeil  Then. 
• hike  beating  vigorously  with  a wire  whip,  oil  is  added,  diop  by  drop,  until  the  mixture  is 
thick  and  creamy  (about  '/,  cup  of  oil).  More  oil  is  then  added  in  a slow  steady  stream  (all 
the  •htle  beating  rigorously)  When  l-t V,  cups  of  oil  have  been  added,  mix  in  a second 
teaspoon  of  vinegar  or  lemon  juice. 
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task  of  conveying  a particular  knowledge  structure  to  the  student.  The  learner 
has  the  task  of  deducing  just  what  structure  is  intended  by  the  teacher,  as  well  as 
the  additional  task  of  adding  the  new  information  to  his  previous  knowledge  in 
such  a way  that  it  can  be  referred  to  and  used  at  a later  time.  Many  of  the 
problems  of  learning  and  teaching  can  be  understood  as  problems  in  this 
communication  process.  Learning,  however,  is  unlike  most  simple  communica- 
tions in  that  the  structures  to  be  acquired  can  be  complex,  and  it  is  not  always 
clear  just  how  they  are  to  fit  together.  Moreover,  the  differences  in  the  knowl- 
edge shared  among  the  participants  in  a learning  situation  are  often  considerably 
greater  than  in  simple  discourse. 

In  many  ways,  this  aspect  of  the  learning  process  is  actually  a problem.  First, 
there  is  the  problem  of  identifying  the  appropriate  referent -of  determining  just 
what  topic  is  under  consideration.  Second,  there  is  the  problem  of  so  specifying 
the  information  being  acquired  and  incorporating  it  within  the  memory  struc- 
ture that  it  can  be  recovered  when  later  it  is  needed. 

Social  conventions  govern  the  form  of  the  interactions  between  a teacher  and  a 
student,  with  certain  well-formed  conventions  about  the  nature  of  the  inter- 
actions and  questions  that  normally  take  ' i.  A private  tutorial  allows  for 
more  interaction  than  does  a seminar.  In  turn,  lectures  allow  few  opportunities 
for  the  communicative  aspect  of  the  learning  process  to  take  place.3 

One  aspect  of  this  communicatory  process  is  that  the  same  teacher  may  use 
quite  different  procedures  to  teach  the  same  material  to  two  different  students. 
In  our  studies  of  tie  tutorial  didogues  between  beginning  students  and  tutors 
and  advanced  students  and  tutors,  the  difference  is  expository  style  is  clear:  the 
tutor  tends  to  lecture  the  beginner;  with  an  advanced  student,  things  are  more 
like  a relaxed  conversational  interchange  of  information.  Books  reflect  this 
difference,  for  the  style  of  the  book  depends  upon  the  level  of  the  student  which 
if  is  addressing.  One  example  readily  available  comes  from  a comparison  of  the 
recipes  in  an  elementary  cookbook  with  the  recipes  of  an  advanced  cookbook. 
Thus,  one  recipe  for  mayonnaise  in  a book  intended  for  advanced  cooks  takes 
exactly  six  sentences  (67  words)  and  a list  of  ingredients.  Essentially  the  very 
same  recipe  for  a beginning  cook  is  around  13  times  longer;  three  pages  of  text 
or  about  900  words  3nd  a list  of  ingredients.4 


5 A textbook  allows  no  interaction,  of  course,  although  in  some  sense  the  use  of  branching, 
teaching  machine  texts  or  simple  computer  assisted  instructions  sequences  do  recover  some 
aspects  of  the  communication.  The  recent  work  by  Carbonell  (1970,  1971)  and  Collins, 
Warnock,  Aiello,  and  Miller  (1975)  and  Brown,  Burton,  and  Zdybel  (1973)  adds  communica- 
tion to  computer-aided  instruction  by  providing  the  computer  system  with  a semantic  network 
of  knowledge  about  the  topic  matter  and  allowing  it  to  interact  in  a tutorial  fashion  with  the 
student. 

‘The  advanced  example  comes  from  “Masterpieces  of  French  Cuisine”  by  Amunatequi 
(1971),  and  the  introductory  example  from  "Mastering  the  Art  of  French  Cooking”  by  Child, 
Bertholle,  and  Beck  (1961). 
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One  technique  we  have  used  to  explore  the  processes  of  learning  and  teaching 
is  that  of  tutorial  instruction.  For  these  tutorial  studies  we  picked  the  topic  of 
French  cooking-to  be  exact,  the  subset  of  techniques  involved  in  the  making  of 
the  family  of  white  sauces.  This  topic  is  well  suited  for  our  purposes:  complex 
enough  that  it  constitutes  a challenge  for  the  learner,  sufficiently  self  contained 
that  one  can  hope  to  learn  a considerable  amount  within  a reasonable  amount  of 
time.  Moreover,  it  is  easy  to  find  well  motivated  students  and  teachers.  (We 
concentrated  the  sessions  on  the  coverage  provided  by  Child,  Bertholle,  & Beck, 
1961,  pp.  54-93.) 

Suppose  you  set  out  to  teach  someone  the  family  of  French  white  sauces.  How 
should  you  present  it?  How  does  one  get  across  the  entire  network  of  inter- 
related concepts? 

When  an  advanced  cook  tutors  a beginner,  there  is  a tendency  to  lecture  at 
first,  describing  the  overall  family  of  sauces.  Then,  when  ihe  overview  descrip- 
tion has  been  completed,  the  beginner’s  lack  of  understanding  often  surfaces, 
causing  a fumbling,  exploratory  interaction  in  which  each  tries  to  understand 
what  the  other  is  thinking. 

Table  2 shows  a small  excerpt  from  such  a tutorial.  The  advanced  tutor  (T)  has 
finished  a twenty  minute  lecture  on  the  white  sauces.  He  concludes  by  saying; 
“and  1 think  that’s  all  there  is  to  say.”  The  student  (S)  has  been  following, 
making  appropriate  comments  along  the  way.  But  now,  unexpectedly,  the 
student  asks  a question  which  indicates  that  she  does  not  understand  the  overall 
patWn  of  sauces.  The  tutor  is  disturbed,  and  there  follows  a period  in  which  the 
tutor  tries  to  straighten  out  the  concepts.  This  is  a portion  of  that  conversation. 
It  starts  with  the  student  attempting  to  summarize  her  understanding  of  the 
sauces.5 

The  tutorial  shown  in  the  table  illustrates  a problem  with  reference.  If  the 
student  finds  a partially  correct  referent,  errors  may  go  undetected.  In  this  case, 
it  seems  that  the  student  originally  had  a concept  for  white  sauce  (the  standard 
American  white  sauce):  heat  together  butter  and  flour  and  then  add  milk.  This 
fits  easily  with  the  new  use  of  the  term  “white  sauce.”  But  french  white  sauces 
can  also  be  made  from  fish  or  chicken  stocks  without  milk  or  cream.  This  new 
procedure  does  not  fit  the  student’s  existing  memory  structures,  and  a good  deal 
of  confusion  ensued  until  the  tutor  was  able  to  straighten  it  out.  It  was  only 
when  the  discrepancy  between  her  previous  conception  of  white  sauce  and  the 
French  conception  was  explicitly  mentioned  that  she  began  to  make  sense  out  of 
her  lesson.  The  interesting  thing  about  this  tutorial  session  was  that  for  the  first 
20  minutes,  neither  tutor  nor  studtnt  realized  that  there  were  any  problems.  It 
came  as  somewhat  of  n shock  to  both  participants  to  realize  that  there  were  vast 
confusions,  and  the  entire  session  lasted  for  45  minutes  beyond  the  point  where 


5We  thark  J.  Lustig,  S.  Schane,  and  F.  Wightman  for  serving  as  tutors  and  tutees. 


9.  SCHEMATA  FOR  LEARNING  189 


TABLE  2 

Beginning  Student  and  Advanced  Cook  Tutorial" 


We  start  out  with  two  different  white 
sauces,  veloute  and  Bechamel-and 
p'  hamel  is  with-milk  did  you  say? 
Right. 

Veloute  is  with  egg  yolk  and  cream 
so  it’s  richer. 

Uh-no.  Bechamel  is  with  milk, 
veloute  is  with  stock.  That’s  the  basic 
difference. 

Oh  that’s  right.  We  made  the  veloute 
into  Parisienne. 

Either  the  Bechamel . . . 

OK. 

or  the  veloute  can  be  a Parisienne. 
What  American  cooks  mean  when 
they  say  a white  sauce  is-Bechamel. 
They  mean  a roux  with- 
Milk. 

Milk. 


take-a-sauce  made  with  stock  in- 
stead of  milk  and  cream,  and  add  egg 
yolks  1 — is  that  ever  done? 

That’s  a Parisienne. 

The  Parisienne  doesn’t  have  to  start 
with  the  veloute. 

The  veloute  is- 
Has-creajn  or  milk. 

No-no-the  veloute  is  the  roux  and 
the  stock. 

I keep-1  keep  mixing  that  up.  The 
veloute  is  stock -and  then  cream - 
The  Bechamel  is  the  milk-base, 
(pause)  And  from  both  of  those  you 
can  get  to  Parisienne  by  adding  egg 
yolks  and  cream. 

OK. 


(The  problem  here  is  that  the  student 
confused  her  knowledge  of  American 
white  sauce  with  the  related  but  dif- 
ferent concept  of  the  French  family 
of  white  sauces.  The  tutor  again  re- 
viewed the  concepts  involved  in  the 
sources.  The  student  tried  to  review:) 


(At  this  point  the  tutor  reviewed  the 
concepts.  The  student  summarized 
the  problem:) 


(long  pause)  OK.  (long  pause)  If  you 


I’m  confused  because  Bechamel  is 
what  I originally  learned  as  a white 
sauce  and  a white  sauce  is  a large 
class  with  all  these  different  kinds  of 
sauces. 

That’s  right. 


“Roux:  A mixture  of  flour  and  butter  that  serves  as  a thickening  agent  Lightly  cooked 
(for  white  sauces),  browned  to  make  brown  roux. 

Stock:  Clear  or  brown  liquid,  usually  made  from  chicken  or  fish  (white  stocks)  or  meat 
brown  stocks). 

Bechamel:  A basic  white  sauce  made  with  roux  and  milk. 

Veloute:  A basic  white  sauce  made  with  roux  and  white  stock. 

Enrichment:  The  addition  of  butter,  cream,  or  egg  yolks. 
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the  tutor  had  intended  to  end  the  session  by  saying,  “and  I think  that’s  all  there 
is  to  say.” 

This  section  shows  one  aspect  of  the  communication  between  teacher  and 
student:  the  difference  between  the  student’s  prior  conception  of  the  topic  and 
the  actual  one  to  be  learned.  This  prior  conception  guides  the  student  into  some 
erroneous  assumptions,  hindering  the  final  acquisition  of  the  concepts. 

Example:  The  jump  problem.  This  last  example  is  the  most  serious  of  the 
three  we  wish  to  describe.  What  we  want  to  discuss  is  how  a student  acquires  a 
schema  for  a concept,  discovers  places  where  it  does  not  operate  in  an  appro- 
priate fashion,  and  then  modifies  the  schema.  We  feel  that  the  most  important 
cognitive  structures  that  we  should  study  are  those  that  allow  the  discovery  of 
the  inappropriate  aspects  and  change  them.  We  will  say  more  about  this  later. 
Alas,  we  will  not  be  able  to  say  it  well  or  precisely. 

In  our  studies,  we  took  students  at  the  University  of  California,  San  Diego  who 
had  no  previous  experience  with  computers  and  put  them  in  front  of  a visual 
(television)  display  and  a typewriter  keyboard.  They  were  given  a series  of 
examples  in  programming  in  the  language  FLOW,  starting  off  with  some  basic 
principles.  At  times,  the  students  were  asked  to  type  a particular  program  into 
the  computer  and  then  to  predict  the  result  of  that  program.  After  the  student 
had  predicted  the  result  of  the  sample  program,  he  could  have  the  compute^ 
perform  the  program  arid  obser  e th“  result  displayed  on  the  screen  in  front  of 
him.  At  times,  we  asked  the  students  to  write  new  programs  that  would 
accomplish  some  goal. 

The  computer  language  is  very  simple.  For  the  aspects  illustrated  in  this 
chapter,  only  two  principles  need  to  be  understood: 

All  programs  consist  of  an  ordered  set  of  commands  (in  the  examples  here, 
most  programs  are  only  2 or  3 lines  long). 

Each  line  of  the  program  is  numbered  for  easy  reference. 

In  this  section  we  illustrate  some  of  the  problems  in  developing  pt  accurate 
schema  for  one  of  the  commands  of  the  language  FLOW.  We  follow  the  course 
of  one  student  who  is  learning  to  use  the  command  “Jump  to  . . . .”  She  has  just 
previously  learned  to  use  the  command  "Print”  in  several  different  programs, 
but  no  program  was  longer  than  two  lines  and  most  contained  only  a single  use 
of  the  “Print”  command.  In  this  section,  we  are  primarily  concerned  with  the 
examination  of  the  interaction  between  the  student’;  understanding  of  the 
concepts  as  represented  by  the  schemata  that  she  has  developed  and  the  program 
that  she  is  creating.  We  trace  the  development  of  the  schema  for  “Jump  to” 
through  successive  stages  of  experience  with  different  programs  that  result  from 
the  schema 

We  start  watching  the  student  at  the  point  where  she  has  been  asked  to  type 
the  following  program,  Program  1,  onto  the  display  terminal. 
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Program  1: 


010  Print  “Rochelle” 
020  Jump  to  010 


Experimenter:  Tim  program  will  make  the  computer  repeat  the  printing  of 
the  word  “Rochelle.”  What  do  you  think  the  output  will 
look  like? 

Student:  The  computer  will  print  the  word  “Rochelle”  twice. 

The  answer  is  consistent  with  the  ordinary  sense  of  the  word  “repeat.”  It  is  also 
consistent  with  the  student’s  prior  experience,  for  in  previous  programs  where 
there  were  no  Jump  statements  and  where  at  most  there  were  two  print 
statements,  any  program  that  repeated  *he  same  printout  printed  the  same  word 
twice.  If  we  could  characterize  this  student’s  schema  for  the  purpose  of  the 
“Jump  to”  instruction,  it  probably  would  look  something  like  this: 

Schema  1:  If  the  instruction  is  “Jump  to  n,"  then  the  computer  does 
instruction  number  n. 


Now  the  student  was  instructed  to  run  Program  1 . When  she  did  so,  the  output 
that  appeared  looked  like  this: 

RochelleRochelleRochelleRochelleRochelleRochelleRochelleRochelleRo  . . . 

Student:  I guess  it  keeps  repeating  until  someone  tells  it  to  stop. 

By  her  comment,  the  student  has  clearly  learned  something  more  about  the 
“Jump”  statement.  To  test  what  she  had  learned,  we  aksed  her  to  enter  a new 
program  into  the  computer  and  to  predict  its  outcome. 


Program  2:  010  Print  “Hi” 

020  Print  “Rochelle” 

030  Jump  to  010 

Experimenter:  What  do  you  think  this  program  will  do? 

Student:  Its  first  instruction  is  to  print  "Hi”  so  it  will  do  “Hi,”  then 
it  will  (pause)  there’s  no  space,  so  it  will  just  go 
“HiRochelle”  for  the  second  instruction.  And  then  it  will 
go  back  to  the  first  instruction  which  was  Print  “Hi,”  so  h 
will  just  write  “Hi”  until  we  tell  it  to  stop. 

We  see  from  this  example  that  the  student  has  modified  Schema  1 into  a new 
form,  something  like  this: 

Schema  2:  Do  each  instruction  in  order  unless  the  instruction  is  a JUMP- 
TO.  If  the  instruction  is  JUMP-TO  n,  then  continue  doing 
instruction  n until  told  to  stop. 


. -»«,  **■**„. 
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Note  that  this  schema,  even  thought  incorrect,  is  perfectly  consistent  with 
everything  the  student  has  seen  up  to  this  point.  She  has  derived  her  notion  of 
sequential  order  of  execution  from  earlier  programs  and  has  used  it  here  to 
predict  the  first  two  elements  of  the  output.  From  Program  1 she  ha*  seen  that 
the  Jump-to  in  that  program  caused  the  instruction  to  be  repeated.  Hence,  she 
developed  Schema  2. 

The  test  of  the  student’s  schema  came  when  she  was  asked  to  run  Program  2. 
Here  is  what  happened: 

HiRochelleHiRochelleHiRochelleHiRochelleHiRocheUeHiRochelle  . . . 

Once  again  the  result  was  not  what  was  expected.  Once  again  the  schema  for 
“Jump”  had  to  be  modified. 

Student:  When  you  say  jump  to  the  first  instruction,  it  will  go  to  that  and 
then  I guess  it  goes  to  the  second  one  and  if  there  isn’t  a second 
one  it  will  just  keep  repeating  the  first  one.  Otherwise  it  will 
repeat  both. 

This  is  a rather  complicated  and  highly  conditionalized  notion,  but  it  is  perfectly 
consistent  with  all  examples  she  has  seen.  When  she  was  asked  to  describe  how 
the  computer  actually  performed  Program  2,  she  provided  a correct  line  by  line 
description.  Her  schema  now  might  be  characterized  like  this: 

Schema  3:  Do  each  instruction  in  order  unless  the  instruction  is  “Jump  to.” 
If  the  instruction  is  “Jump-to  n”  then  begin  doing  instruc- 
tions at  number  n. 

If  there  are  no  more  instructions,  stop. 

Again,  we  tested  her  knowledge  by  asking  her  to  type  a specified  program  and  to 
predict  the  result: 

Program  3:  010  Jump-to  030 
020  Print ' Hi” 

030  Print  “Rochelle” 

Student:  The  computer  will  go  to  the  third  instruction  and  print 
“Rochelle”  then  to  the  second  and  print  “Hi”  an i then  to  the 
third  again  and  print  “Rochelle.” 

The  actual  result  is  this: 

Rochelle 

Only  the  one  word  is  printed,  and  then  the  program  halts,  Why  did  the  student 
predict  what  she  did,  when  according  to  Schema  3,  she  should  have  been  able  to 
predict  the  result  properly?  Evidently  she  has  other  schemata  about  the  opera- 
tion of  the  computer.  Many  students  seem  to  believe  that  every  statement  must 
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be  executed  at  least  once,  and  this  schema  could  apply  here.  If  so,  this  causes  a 
conflict  with  Schema  3,  which  might  possibly  be  resolved  by  a reversion  to  one 
of  the  earlier  schemata  for  “Jump.”  Whatever  the  reason,  it  was  a simple  matter 
for  the  student  to  modify  her  schema  for  “Jump."  When  she  saw  that  the  output 
was  the  single  word  “Rochelle,”  she  was  readily  able  to  determine  why: 

Student:  The  first  instruction  tells  it  to  go  on  to  the  third  and  then  there  is 
no  instruction  to  tell  it  what  to  do  so  it  stops. 

Now,  finally,  she  seems  to  have  a complete  and  correct  schema  for  the  Jump 
instruction.  When  given  two  more  tests,  she  predicted  the  results  correctly: 

Program  4:  010  Print  “Hi” 

020  Print  “Rochelle” 

030  Jump-to020 

The  predicted  and  correct  result  is: 

HiRochelleRochelleRochelle Roche UeRochelleRochelleRochelleRochelle  . . . 

This  shows  that  the  student  doesn’t  believe  that  each  repetition  needs  to  be  the 
same. 

Program  5:  010  Print  “Hi” 

020  Jump-toOlO 
030  Print  “Rochelle” 

The  predicted  and  correct  result  is 


This  program  shows  that  she  understands  that  not  every  line  need  be  followed. 

These  examples  point  out  the  ways  in  which  a student  formulates  hypotheses 
about  the  concepts  which  are  being  taught,  applies  those  hypotheses,  and 
modifies  them  when  necessary.  Learning  appears  to  be  organized  around  small, 
simple  schemata  that  can  be  applied  to  situations  wherever  deemed  appropriate. 
Part  of  the  task  we  must  face  is  to  *.  etermine  how  a person  comes  to  acquire, 
apply,  and  modify  these  schemata. 


Learning  by  Modification  of  Existing  Schemata 

To  transform  the  examples  of  the  use  of  schemata  that  we  have  presented  here 
into  a viable,  useful  theory  of  learning,  we  need  to  specify  with  more  precision 
just  what  it  is  that  takes  place  when  a schema  is  selected,  used,  and  then 
modified.  We  are  not  ready  to  report  much  infoimation  here,  but  we  can  tell 

you  of  some  related  work  that  seems  relevant. 

First,  consider  what  kinds  of  structure  a person  needs  in  order  to  be  able  to 
modify  his  schemata.  The  student  needs  to  be  able  to  compare  the  results 
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predicted  by  a given  schema  with  the  results  that  actually  occur.  Then  he  needs 
to  determine  the  points  of  mismatch,  and  then  correct  that  mismatch  Thus  he 

pr^“  with  some  ac“«  to  the  procedures  which  it  is  examining  and 
modifying.  This  is  no  simple  feat.  8 

The  problem  of  learning  through  errors  is,  of  course,  a well  known  one 

STlJST  and  b?  ed“cationtl  V°UP  at  the  Artificial  Intelligence  Labora- 

Crt  caUs  if^^n  3 8 f T °f  thlS  f°m  °f  ,Camin8-  “"W*  “ "hat 
rtpert  calls  it-the  process  of  eliminating  the  errors  or  bugs  in  one’s  own 

knowledge.  The  skills  of  debugging  are  clearly  important  ones.  Papert  believes  it 

is  perhaps  even  more  important  to  teach  a child  how  to  debug  his  own 

knowledge  than  to  teach  him  the  knowledge  itself.  The  implication  is  that  if  a 

child  loiows  how  to  learn,  then  he  can  get  the  knowledge  by  himself.  We  find 

that  this  philosophy  strikes  a sympathetic  chord:  Why  do  we  not  attempt  to 

each  some  basic  cognitive  skills  such  as  how  to  organize  one’s  knowledge,  how 

to  learn,  how  to  solve  problems,  how  to  correct  errors  in  understanding:  these 

stnke  us  as  basic  components  which  ought  to  be  taught  along  with  the  content 
matter. 

ilhrere.h.as  !rn  *°me  WOrk  “ the  Massachusetts  Institute  of  Technology 
Artificial  Intelligence  Laboratories  on  the  mechanisms  necessary  for  transform- 
ing schemata  that  were  in  error  into  better  ones.  One  of  the  best  known 
examples  of  this  work  was  performed  by  Winston  (1973).  Winston  showed  his 
system  figures  of  block  structures.  The  system  would  develop  network  represen- 
auons  .or  the  structures  (very  similar  in  form  to  the  structural  networks  of  the 
LNR  representation).  Winston’s  system  corrects  schemata  by  comparing  the 
representations  for  the  various  objects  and  noting  the  differences.  The  network 
get,  modified  according  to  the  nature  of  these  differences.  In  fact,  Winston 
shows  how  the  most  important  aspect  of  the  training  sequence  is  the  near  miss: 
the  appropriate  form  of  deviation  from  the  schema. 

fiHH°WfWithOUt  eXCCpti0n’  every°ne  we  have  ever  talked  to  who  is  either  in  the 

w t °«TJl,C.at'0n  u'  leaming  ***  Vi!y  angry  when  we  te"  them  of  Winston’s 
anv  elej  / * n°th,ngVe,y  :emarkabl'-' ” ‘hey  sputter,  “why  if  you  go  look  at 
any  elementary  education  text,  or  thr  work  of ...  ” etc.  We  agree,  but  disagree. 

Winstons  work  ,s  not  important  for  the  concept  of  the  near  miss  What  is 
important  is  that  he  has  managed  to  develop  a formal  procedure  for  representing 
certain  kinds  of  knowledge  and  then  for  changing  that  representation  when  it  is 
found  not  to  be  appiopriate.  It  is  one  of  the  most  sophisticated  examples  of  a 
teaming  program  of  which  we  are  aware. 

More  recently  at  The  Massachusetts  Institute  of  Technology,  Sussman  (1972) 
an  Goldstein  (1973)  have  taken  mother  step  closer  to  the  development  of 

D^ZStPf^»IrCS'  They  hfVe  managed  t0  devel°P  systems  that  can  correct 
programs,  find  the  errors,  and  modify  them  appropriately. 

l *r?uSti"  r0t  3!. thC  P0'"'  wherc  Psychological  models  can  be  developed 
‘ kf°rma)^lonj  of  schemata,  and  the  suggested  mechanisms  for 
modification,  but  we  believe  that  we  are  close. 
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SOME  GENERAL  COMMENTS 

Learning  involves  the  acquisition  of  new  cognitive  structures  built  upon  old, 
previously  acquired  structures.  Such  a statement,  by  itself,  says  little  that  has 
not  been  said  many  times  before.  What  we  believe  might  be  new  is  some 
understanding  of  the  underlying  representation  of  these  structures. 

The  overall  structure  for  a concept  matter  is  not  yet  known.  In  this  chapter  we 
have  discussed  some  extremely  simple  knowledge  frameworks:  for  verbs,  for 
simple  recipes,  and  at  the  level  of  verbal  description,  for  a programming  lan- 
guage. At  this  point  we  wish  to  speculate  upon  two  or  three  issues  relevant  to 
the  acquisition  and  use  of  these  schemata.  First,  some  comments  upon  theory. 
Second,  some  comments  on  what  should  (or  could)  be  taught.  Third,  some  brief 
comments  on  the  problem  of  communicating  the  relevent  atrocities  to  the 
student. 

On  theory  What  has  been  shown  here  is  very  incomplete.  It  certainly  is  a far 
distance  away  from  what  we  claimed  to  be  interested  in:  the  leamii.t  of  comples 
topics.  We  need  to  specify  how  we  interrelate  all  the  information.  W<  still  need  a 
lot  of  work  on  this  problem.  We  think  the  important  principle  is  that  the 
material  is  organized  around  schemata.  There  docs  not  seem  to  be  a homoge- 
neous network  structure.  Rather  there  seems  to  be  well  structured  means  for 
organizing  the  information,  and  for  functional  procedural  definitions.  Moreover, 
the  schemata  provide  means  for  applying  the  structures  to  new  and  to  unusual 
situations. 

On  what  might  one  teach  Two  general  principles  seem  applicable  . 

Introduce  the  general  framework  for  the  material  that  is  to  be  learned. 

Build  upon  the  general  knowledge  that  previously  existed. 

One  different  form  of  knowledge  that  we  believe  to  be  important  is  to  teach 
learning  skills: 

The  student  should  know  how  to  evaluate  and  modify  his  schemata. 

In  the  examples  shown  above,  it  has  been  importan  that  existing  schemata 
could  be  modified.  This  is  true  whenever  a schema  is  inappropriate  either 
because  one  simply  doesn’t  know  enough  or  because  ne  is  reasoning  by 
analogy. 

In  general,  one  has  to  know  how  to  understand  the  nature  of  this  reasoning 
process  to  be  effective  at  doing  it.  We  have  been  impressed  with  Papert  s 
teaching  of  “debugging  skills”-we  believe  he  is  correct  when  he  places  heavy 
emphasis  on  this. 

functional  reasoning  Collins,  Wamock,  Aiello,  and  Miller  (1975)  have 
demonstrated  some  nice  examples  of  what  they  called  “functional  reasoning, 
reason  in  which  knowledge  is  deduced  from  general  principles.  The  most  inter- 
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esting  cases  occur  when  the  person  doing  the  reasoning  is  not  aware  that  he 
knows  any  relevant  general  principles.  What  in  fact  happens,  we  believe,  is  two 
things.  First  there  is  something  akin  to  our  mayonnaise  example:  reasoning  by 
analogy.  Second,  comes  the  examination  of  several  examples  and  a generaliza- 
tion. A general  principle  can  be  considered  to  be  one  in  which  the  schema  has 
many  of  its  constant  terms  replaced  with  variables  (the  variables  will  have 
constraints  placed  upon  the  set  of  concepts  which  may  be  used  to  fill  them). 

The  process  by  which  one  takes  specific  knowledge  about  a particular  instance 
° 8 concept  or  01  an  “Penence  and  generalizes  it  to  apply  to  a larger  class  of 
experiences  is  one  that  needs  a good  deal  of  study.  One  suspects  that  this 
generalization  process  is  at  the  heart  of  much  of  our  everyday  operations  in 
which  new  situations  must  be  dealt  with  by  the  experience  gained  from  old  ones 
Learning  by  analogy,  learning  by  modification  of  existing  schemata,  the  use  and 
interpretation  of  metaphor,  and  functional  reasoning  would  all  appear  to  be 
related  examples  of  this  generalization  of  knowledge.  As  we  gain  in  our  under- 
standing of  the  structures  of  hunun  memory  and  in  the  ways  by  which  the 
knowledge  structures  are  acquired,  modified,  and  used,  we  will  come  to  enrich 
our  understanding  of  learning,  of  teaching,  and  of  the  human  use  of  knowledge. 
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INTRODUCTION 

The  task  of  developing  a psychology  of  instruction  is  formidable  because  we 
must  first  understand  the  nature  of  knowledge,  how  it  is  acquired,  under  what 
conditions  it  might  be  taught,  and  the  signs  by  which  its  attainment  might  be 
celebrated.  Of  course,  our  task  would  be  foredoomed  if  every  area  of  knowledge 
were  so  distinctive  in  its  requirements  on  the  human  mind  that  completely 
different  cognitive  processes  were  invoked  in  each  case.  If  so,  then  the  most  help 
educators  might  realistically  expect  from  psychologists  would  be  a pluralism  of 
principles  consisting  of  independent  sets  of  heuristic  tricks,  especially  tailored 
for  .ach  area  of  pedagogical  focus. 

Clearly,  the  working  hypothesis  which  best  serves  both  psychology  and  educa- 
tion  assumes  that  knowledge-gathering  processes  of  mind  are  essentially  the  same 
across  all  disciplines,  that  any  differences  will  be  in  detail  rather  than  principle. 
Let  us  then  begin  by  posing  the  central  problem  for  a cognitive  theory  of 
instruction  in  e way  that  presupposes  this  working  hypothesis:  what  is  the 
nature  of  the  general  cognitive  capacity  that  underlies  all  knowledge  acquisition? 
It  is  to  this  question  that  this  chapter  is  addressed. 
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THE  ABSTRACT  NATURE  OF  CONCEPTS 

A basic  characteristic  of  human  intelligence  is  the  ability  to  formulate  abstract 
conceptual  knowledge  about  objects  and  events.  Abstract  conceptual  knowledge 
is  exemplified  when  we  can  deal  appropriately  with  novel  instances  of  a concept, 
that  is,  when  our  knowledge  goes  beyond  just  those  instances  experienced. 

There  is  abundant  evidence  that  our  knowledge  of  language  must  be  abstract 
given  the  novelty  that  must  be  dealt  with.  Indeed,  the  role  of  novel  events  in 
language  has  long  been  recognized  by  linguists.  Sentences  are  almost  always 
novel  events.  To  verify  this  fact  you  need  only  pick  at  random  a sentence  in  a 
book  and  then  continue  through  the  book  until  the  sentence  is  repeated.  Unless 
you  have  picked  a cliche  or  a thematic  sentence,  it  is  unlikely  that  the  sentence 
will  reoccur.  We  readily  admit  that  most  sentences  are  novel,  but  what  about  the 
elements  from  which  sentences  are  constructed?  These  elements  must  be  the 
same  in  order  for  us  to  understand  sentences.  Further  examination,  however, 
shows  thai  words  too  are  typically  novel  events.  The  apparent  physical  sameness 
of  words  is  an  illusion  supported  by  the  use  of  printing  presses.  If  we  consider 
handwriting,  we  find  a great  deal  of  variation  in  the  construction  of  letters  and 
words.  The  novelty  of  words  becomes  even  more  clear  when  we  think  of  the 
same  word  spoken  by  different  speakers,  male  and  female,  child  and  adult,  or  by 
the  same  speaker  when  he  is  shouting  or  whispering.  Words,  like  sentences,  are 
typically  novel  events.  To  say  that  wor^s  are  novel  events  may  be  incorrect  in 
some  instances.  We  have  heard  our  friends  use  the  same  words  many  times,  our 
own  names  being  a case  in  point.  The  importance  of  the  argument  for  novelty  is 
to  illustrate  that  this  repetition  is  not  necessaiy  for  our  understanding  of  words; 
thus  our  ability  to  recognize  words  is  not  a function  of  having  experienced  that 
particular  physical  event  before. 

Greenburg  and  Jenkins  (1964)  demonstrate  an  even  more  striking  example  of 
the  capacity  to  deal  with  novel  instances  of  a class.  They  found  that  English 
speakers  could  deal  appropriately  with  novel  sequences  of  English  phonemes. 
Sequences  of  Phonemes  in  English  are  subject  to  powerful  constraints  described 
by  rule  structures  for  syllable  and  word  formation.  If  we  randomly  sample 
strings  of  English  phonemes  we  will  produce  three  types  of  strings:  strings  which 
are  actual  English  syllables  or  words;  strings  which  violate  the  rules  for  English 
syllables  and  word  construction  and  therefore  are  not  English  syllables  or  words; 
and  finally,  strings  which  are  in  accord  with  English  rule  structure,  but  are  not 
found  in  English.  Given  only  consonant-vowel-consonant  (CVC)  strings  we  all 
recognize  cat  as  an  actual  English  word  and  cah  as  clearly  not  an  English  word. 
However,  what  about  the  strings  dib  and  lutt?  Both  of  these  CVCs  are  in  accord 
with  English  rules  of  syllable  construction.  Dib  is  in  fact  an  actual  English  word. 
Consequently,  Greenberg  and  Jenkins  constructed  a measure  of  distance  from 
English,  based  upon  the  rules  of  English  syllable  construction,  which  accurately 
predicted  subjects’  judgments  of  novel  strings  of  phonemes.  The  subjects’  judg- 
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merits  about  novel  strings  of  phonemes  were  consistent  and  predictable  on  the 
basis  of  linguistic  rules  for  syllable  construction  in  English. 

This  research  clearly  demonstrates  that  the  knowledge,  on  the  basis  of  which 
English  speakers  recognize  and  construct  English  syllables  and  words,  is  abstract 
in  the  sense  that  it  is  not  knowledge  of  particular  physical  events,  but  rather 
knowledge  about  systems  of  abstract  relationships.  One’s  ability  to  recognize 
sequences  of  phonemes  not  experienced  before  as  acceptable  or  unacceptable 
English  strings  demonstrates  knowledge  of  rules  of  sequencing  of  phonemes,  that 
is,  abstract  conceptual  knowledge  which  allows  us  to  recognize  and  produce 
novel  events. 

But  phonemes  too  are  abstract  classes  of  events  which  cannot  be  specified  in 
terms  of  common  physical  elements.  Research  in  the  perception  of  speech  has 
shown  that  the  same  phonemes  are  specified  by  different  physical  events  in 
different  contexts  and  that  the  same  physical  event  can  specify  different  pho- 
nemes in  different  contexts  (Fant,  1964;  Libernvui,  Cooper,  Shankweiler,  & 
Studdert-Kennedy,  1967).  So,  with  phonemes  too,  the  basis  of  recognition  is 
knowledge  of  a code  or  system  of  relationships,  not  knowledge  of  particular 
physical  elements.  As  we  have  seen,  breaking  language  events  into  smaller  and 
smaller  elements  does  not  result  in  a level  of  analysis  based  upon  particular 
physical  elements.  Rather,  at  each  level  we  find  still  another  system  of  abstract 
relations  which  is  necessary  to  specify  the  nature  or  meaning  of  particular 
physical  events. 

Similarly,  we  are  able  to  recognize  a melody  played  on  a piano  even  though 
previously  we  have  only  experienced  instances  of  that  melody  played  on  other 
instruments,  or  by  an  orchestra,  or  even  hummed.  To  do  so,  therefore,  we  must 
have  an  abstract  concept  of  the  melody  that  specifies  the  isomorphism  existing 
among  the  various  instances.  Often  we  are  able  to  recognize  that  a painting  is 
executed  in  the  style  of  impressionism  or  by  a particular  artist,  say  Cezanne, 
even  though  we  have  never  seen  that  particular  work  before.  To  do  so  we  must 
have  an  abstract  concept  that  specifies  the  style  of  the  school  or  artist  such  that 
the  instances,  novel  ones  included,  are  seen  as  similar.  Thus,  there  seems  to  be 
ample  reason  to  conclude  that  concepts  are  not  necessarily  based  upon  knowl- 
edge of  particular  physical  events,  nor  upon  physical  units,  elements,  or  features, 
since  instances  of  many  concepts  are  only  abstractly  related. 


GENERATIVE  CONCEPTS 

Due  to  their  generality,  abstract  concepts  apply  to  a potentially  infinite  equiva- 
lence class  of  instances.  Howe'er,  this  fact  poses  a serious  problem  for  a 
cognitive  theory  bent  upon  explaining  how  they  are  acquired.  Since  one’s 
experience  is  with  but  a sample  of  the  entire  set  of  instances  to  which  such 
concepts  refer,  several  puzzling  questions  arise:  First,  how  can  experience  with  a 
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subset  of  objects  or  events  lead  to  knowledge  of  the  whole  set  to  which  it 
belongs?  There  is  a problem  of  explaining  how  some  part  of  a structure  can  be 
equal  to  the  whole  structure.  Indeed,  the  claim  that  some  can,  under  certain 
circumstances,  be  equivalent  to  all  seems  to  involve  a logical  contradiction.  That 
it  does  not,  fortunately  for  cognitive  theory,  can  be  amply  illustrated  in  many 
different  areas  of  conceptual  knowledge.  In  a moment,  we  will  illustrate  this  fact 
from  examples  drawn  from  three  distinct  fields-mathematics,  linguistics,  and 
perceptual  psychology. 

A second  crucial  question  that  must  be  answered,  given  a precise  answer  to  the 
first,  concerns  the  nature  of  the  subset  that  can  provide  the  knowledge  necessary 
to  deal  with  the  entire  set.  Will  just  any  subset  of  instances  do,  or  must  the 
subset  be  a certain  size  or  quality?  In  other  words,  how  do  instances  of  a 
concept  qualify  as  exemplary  cases  of  the  concept?  A precise  answer  to  this  iast 
question  has  quite  obviejs  implications  for  the  selection  of  effective  instruc- 
tional materia!  for  teaciiirg  concepts. 

Generative  concepts  in  mathematics.  In  mathematics  the  concept  of  an  infi- 
nite set  provides  a structure  for  which  it  is  true  that  a proper  subset  is  equal  to 
the  total  set.  Cantor  proposed  this  definition  of  the  infinite  when  lie  discovered 
that  a subset  of  all  natural  numbers,  such  as  the  even  integers,  can  be  placed  into 
a one-to-one  relationship  with  the  total  set  of  integers.  But  a more  relevant  case 
for  our  purposes  is  the  problem  of  providing  a precise  description  for  an  infinite 
chss  of  objects.  By  a precise  description  is  meant  a finite  specification  of  every 
instance  of  the  infinite  class. 

A moment’s  reflection  suffices  to  conclude  that  so-called  nominal  concepts  are 
quite  inadequate  for  this  purpose  since  it  is  impossible  to  ostensively  define  an 
infinite  class,  say  by  pointing  to  each  element.  Hence  the  label  “infinity”  could 
not  be  consistently  applied  since  finite  enumeration  will  not  discrimate  between 
classes  just  a little  larger  than  the  ostensive  count  and  ones  infinitely  larger. 

For  similar  reasons  so-called  attributive  concepts  of  infinite  classes  are  not 
possible  since  the  attempt  to  abstract  common  features  from  all  members  of 
such  classes  fails.  If  not  every  member  of  an  infinite  class  is  surveyed  by  a 
process  of  finite  abstraction,  then  a potentially  infinite  number  of  cases  may 
exist  which  fail  to  exibit  the  attribute  common  to  the  finite  subset  actually 
experienced.  Thus,  the  learning  of  concepts  that  refer  to  classes  with  a poten- 
tially infinite  number  of  members  such  as  trees,  people,  red  stars,  cannot  be 
adequately  explained  by  a cognitive  process  involving  finite  abstraction.  The 
process  of  abstraction  postulated  to  explain  the  acquisition  of  abstract  concepts 
must  work  in  some  other  way.  As  a mill  for  abstract  knowledge,  it  must  take  a 
finite  set  of  exemplars  as  grist  for  producing  concepts  of  infinite  extension. 

This  problem  has  perplexed  philosophers  for  many  centuries,  leading  some 
empiricists  and  nominalists  to  propose  that  in  fact  no  concept  of  an  infinite  class 
is  really  possible.  Their  argument  was  based  upon  the  belief  that  since  finite 
abstraction  is  the  means  by  which  all  concepts  are  formed,  then  the  concept  of 
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the  infinite  must  be  a negative  concept  referring  only  to  our  ignorance  regarding 
the  exact  size  of  a very  large  class  which  had  been  indeterminately  surveyed  by 
the  senses.  This  belief  constitutes  a refusal  to  recognize  the  creative  capacity  of 
human  intellignece  and  led  the  empiricists  to  a theory  of  knowledge  founded 
upon  associative  principles— principles  which  define  knowledge  as  nothing  more 
than  the  association  of  memoranda  of  past  sense  impressions— what  Dewey 
(1939)  rightfully  called  “dead”  ideas  because  they  cannot  grow. 

Infinite  structures  can  only  be  represented  by  finite  mea  .s  if  the  finite  means 
are  creative,  in  the  sense  that  a schema  exists  by  which  the  totality  of  the 
structure  can  be  specified  by  some  appropriate  finite  part  of  the  structure.  Such 
a schema  by  which  the  whole  can  be  generalized  from  an  appropriate  part  can  be 
called  a generative  principle,  while  the  appropriate  part  can  be  called  a generat- 
ing substructure  or  just  generator  for  short.  That  a structural  totality  can  be 
specified  by  a generator  plus  a set  of  generative  principles  can  usually  be  verified 
by  the  principle  of  mathematical  induction. 

Consider  the  problem  of  how  one  comes  to  know  the  concept  of  natural 
numbers.  Two  stages  seem  to  be  involved:  one  must  first  learn  the  set  of 
numerals  0,  1,  2, ....  9 as  well  as  a system  of  syntactic  rules  by  which  they  may 
be  concatenated  to  form  successively  ordered  pairs  (e.g.,  10,  1 1 , ....  99),  triples 
(e.g.,  100,  101, . . . 999),  etc.  The  number  of  numerical  strings  is,  of  course, 
potentially  infinite.  Hence  the  numeral  set  0,  1,  2, . . . , 9 constitutes  the 
generator  which  potentially  yields  all  possible  well-ordered  numerical  strings 
when  the  appropriate  generative  rules  of  the  grammar  are  applied. 

The  second  stage  in  acquiring  the  concept  of  natural  numbers  entails  inter  alia, 
not  only  knowledge  of  the  grammar  for  numerical  labels,  but  knowledge  of  the 
closure  of  arithmetic  operations  by  which  (a)  any  number  can  be  shown  to  be  a 
logical  product  of  an  arithmetic  operation  applied  to  a pair  of  numbers  e.g.,  1 + 
0=1, 1 + 1=  2,  1+2  = 3,...  ,and  <i)  any  logical  product  of  numbers  always 

yields  numbers. 

Indeed,  it  does  not  take  children  long  to  realize  that  any  combination  or 
permutation  of  the  members  of  the  generator  set  (0,  1 , 2, . . . , 9)  yield*  a valid 
number.  For  example,  is  9701  an  instance  of  ti.e  concept  of  natural  number?  Of 
course,  you  will  recognize  it  as  a valid  instance.  But  how  do  you  know?  Have 
you  ever  seen  this  number  before?  Does  it  matte*?  Unless  it  is  part  of  an  old 
phone  number,  address,  or  some  serial  number  that  you  have  frequently  dealt 
with,  then  you  probably  have  no  idea  whether  it  is  a familiar  or  novel  instance  of 
the  concept  of  natural  number.  Nevertheless,  one  knows  immediately  that  it  is 
an  instance,  presumably  because  one’s  knowledge  of  strings  of  numerals  is  as 
abstract  as  that  for  English  sentences. 

Generative  linguistic  knowledge.  A similar  line  of  argument  can  be  developed 
with  respect  to  the  best  way  to  characterize  a speaker’s  knowledge  of  his  native 
language.  The  problem  is'  “how  do  we  acquire  the  linguistic  competence  to 
comprehend  sentences  that  have  never  before  been  experienced?’’  For  instance, 


\ 


202  ROBERT  E.  SHAW  AND  BUFORD  E.  WILSON 


it  is  unlikely  that  you  have  ever  experienced  the  following  sentence:  the  impish 
monkey  climbed  upon  the  crystal  chandelier,  gingerly  peeled  the  crepes  from  the 
ceiling,  and  threw  them  at  the  furious  chef  This  fact,  however,  in  no  way 
diminishes  your  ability  to  recognize  it  as  a grammatical,  if  novel,  sentence. 

Whatever  the  precise  details,  it  seems  clear  that  the  child  acquire*  generative 
knowledge  of  his  language  from  limited  experience  with  a part  of  the  whole 
corpus  that  is  potentially  available.  Furthermore,  on  the  basis  of  this  limited 
experience,  he  is  able  to  extrapolate  knowledge  about  sentences  never  before 
experienced  by  him,  as  well  as  knowledge  about  those  never  before  experienced 
by  anyone. 

Presumably,  the  child’s  immediate  linguistic  environment  consisting  of  his  or 
her  family  and  local  aspects  of  his  culture,  provides  him  with  a generator  set  of 
exemplar)'  structures  from  which  he  educes  the  generative  principles  by  which 
all  other  sentences  are  known.  Chomsky  (1965)  has  argued  that  a transforma- 
tional grammar  provides  the  operations  defining  the  mapping  of  the  generator 
set  of  clear  case  utterances  onto  the  corpus  of  all  utterances;  other  theorists 
disagree.  However,  no  one  disputes  the  fact  that  the  acquisition  of  language 
requires  cognitive  schemata  that  are  truly  generative  in  nature. 

It  is  also  worth  noting  that  during  acquisition  specific  memory  for  sentences 
experienced  seems  to  play  no  necessary  role  in  the  process.  Several  lines  of 
research  support  this  contention.  Sachs  (1967)  demonstrated  that  subjects  were 
unable  to  recognize  syntactic  changes  in  sentences  that  did  not  change  their 
meaning  as  readily  as  they  were  able  to  detect  changes  in  meaning.  This  suggests 
that  people  often  do  not  remember  the  explicit  form  of  sentences  experienced. 
Other  researchers  (e.g.,  Blumenthal,  1967;  Mehler,  1963;  Miller,  1962»Rohrman, 
1968)  argue  that  rather  than  the  surface  structuic  of  sentences  being  remem- 

red,  it  is  the  deep  structural  relations  specified  by  current  transformational 
a ammars  that  characterize  the  abstract  conceptual  knowledge  retained  in 
memory. 

One  important  insight  that  emerges  from  a study  of  such  cases  is  that  for 
generative  concepts  there  are  no  truly  novel  instances.  There  are  only  those 
instances  that  are  actual,  because  they  belong  to  a generator  set,  and  those  that 
are  potential,  because  they  lie  dormant  among  the  remaining  totality  of  in- 
stance;. Consequently,  the  only  difference  between  actual  versus  potential 
instances  is  whether  the  instance  has  been  made  manifest  by  application  of  the 
generative  principle.  Once  done  so,  a newborn  instance  bears  no  marks  of  its 
recent  birth  to  denote  that  it  is  new  rather  than  old. 

If  the  above  reasoning  is  valid,  we  me  able  to  formulate  our  first  empirical 
hypothesis:  if  people  obtain  abstract  concepts  then  they  will  not  necessarily  be 
able  to  recognize  novel  instances  of  the  concept  as  being  novel;  that  is,  instances 
in  the  generator  set  of  it  concept  (Le.,  clear-case  exemplars)  will  not  always  be 
distinguishable  from  those  instances  never  before  experienced. 


. ..  ---- 
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In  the  next  section  we  review  some  of  the  research  recently  completed  at  the 

tivTmte.  °/ UmneS°‘a  Wh“ch  ,cnds  Plausibility  to  the  hypothesis  that  genera- 
systems  theory  provides  a precise  description  of  the  function  of  the 
cognitive  capacity  by  which  we  obtain  abstract  concepts. 


EXPERIMENTS  ON  GENERATIVE  CONCEPTUAL  SYSTEMS 

The  problem  of  how  people  learn  abstract  conceptual  systems  is  by  no  means 
new  to  Psychology  Sir  Fredric  Bartlett  (1932),  in  his  classic  book  Remember- 
mg  realized  that  what  people  learn  must  be  some  kind  of  an  abstract  system  or 
schema  rather  than  a discursive  list  of  simple  instances.  Clearly  concepts  can  be 
amed  from  a small  set  of  very  special  instances,  what  might  be  called  proto- 
types or  exemplars  of  the  concept.  Considerable  research  has  shown  this  to  be 
tne  case  However,  in  doing  so  some  curious  results  were  uncovered.  Further- 

"*5?  t0  CharaCter,Ze  preCiSCly  the  natUre  of  P'°t°typic  instances 
sufficient  for  the  learning  of  a given  concept  proved  more  elusive  than  expected. 

Attneave  (1957a)  demonstrated  that  experience  with  a prototype  facilitated 
paired-associate  learning  involving  other  instances  of  the  concept.  In  related 
research  Posner  and  Keele  (1968)  found  that  subjects  were  able  to  classify 
correctly  novel  dot  patterns  as  a result  of  experience  with  classes  to  patterns 
htch  were  abstractly  related  to  the  novel  instances,  i.e.,  related  by  statistical 
rules  rather  than  by  feature  similarity.  Later,  Posner  and  Keele  (1970)  isolated 
e following  properties  of  the  conceptual  systems  which  enabled  subjects  to 
classify  novel  instances  of  the  classes  of  dot  patterns:  (1)  this  conceptual  system 
Hth  ab$tracted  durin8  lnit,al  experience  with  the  classes  of  patterns,  and  (2) 
t from  experience  with  patterns,  it  was  not  based  upon  stored 
th,  Th  ??  Pk-  temS'  nC  WCek  3fter  the  ori«inal  experience  with  the  patterns, 

while  d hSlty  t0  f aSfy  thC  pattemS  actually  seen  earlier  had  decreased, 
whde  the  r ability  to  classify  “new”  prototypic  patterns  surprisingly  had  not. 

Th  s result  supports  Bartlett’s  view  by  strongly  suggesting  that  these  “new” 
instaices  were  classified  in  terms  of  a highly  integrated  system  of  abstract 

Ilillw*  K°nCeP  SyStem)  rather  than  beinB  mediated  at  the  time  of 
classification  by  memory  of  individual  patterns. 

The  question  then  is:  “how  can  a subset  of  instances  of  a class  be  used  to 

S th^C  CntlrCi  dr?”  °"e  3VenUe  'hM  WC  are  investi«ating  is  to  see  what 
. ght  the  concept  of  group  generator  may  give  into  the  generative  nature  of 
conceptual  systems. 

The  notion  of  a group  generator  can  be  understood  intuitively  by  carefully 
u ymg  the  illustrations  of  the  generator  and  nongenerator  sets  of  stimuli  used 
m the  experiment  reported  below,  page  207.  One  should  notice  that  the 
generator  set  consists  of  cards  whose  relations  define  the  displacements  figures 
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undergo  when  orbiting  around  the  center  of  the  card,  that  is,  when  the  ordered 
sequence  of  cards  specifies  orbiting.  On  the  other  hand,  the  nongenerator  set  of 
stimuli  consists  of  cards  that  are  physically  similar  to  those  in  the  generator  set, 
differing,  however,  in  that  no  sequence  of  these  cards  is  sufficient  to  specify 
the  orbiting  concept.  At  most  they  specify  a displacement  of  four  figures  over 
the  diagonal  path  running  from  the  upper  left  to  the  lower  right  hand  comers  of 
the  card. 

In  the  next  section  a more  formalized  account  of  the  group  generator  notion  is 
presented. 

The  Concept  of  Group  Generator 

Many  examples  of  the  generative  property  of  mathematical  groups  exist.  For 
instance,  for  each  integer  n,  it  is  possible  to  construct  a group  having  exactly  n 

element  (a  group  of  order  n)  by  considering  l,  a,  a1,  a3 an~  l,an,  where 

a0  = aK  = 1 and  the  operation  is  ordinary  algebraic  multiplication.  Such  a group 
is  called  cyclic  because  the  initial  element  (a0)  is  identical  to  the  terminal 
element  (a  ),  the  symbol  a is  called  a generator  of  the  group,  since  every  group 
element  is  a power  of  a,  that  is,  a X a =a2 , a X a3  =a3 aXa""1  =a"=a°. 

The  (integer)  representation  of  the  concept  of  a group  with  a generator  is  but 
one  application  of  this  abstract  system.  As  another  example,  consider  the 
rotational  (cyclical)  symmetry  of  a square.  Let  each  vertex  of  the  square  be 
labeled  (1,  2,  3,  4.)  and  represented  as  the  bottom  row  of  a matrix.  Then  let 
each  position  initially  occupied  by  these  vertices  be  similarly  labeled  and 
represented  in  the  top  row  of  the  same  matrix: 

positions  (P)  (1  2 3 4) 

vertices  (V)  (1  2 3 4) 

We  now  define  a 90°  clockwise  rotation  of  the  square  as  follows: 

/*  2 3 4 \ 90°  /I  2 3 4 \ 

\ I 2 3 4 / \4 I 2 3/ 

The  360  rotation  of  the  square  can  be  similarly  represented  is  four  90° 
rotations: 

I 1! 

/I  2 3 4 \ 90° / 1 2 3 4\I80°/1 
\l  2 3 4/  " U 1 2 3/  \3 


111 
2 3 4' 
4 1 2; 


IV 

\270°/l  2 3 4\360°  / 1 2 3 4\ 
/ \ 2 3 4 1/  \I  2 3 4/ 


1 
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The  configurations  I — IV  are  the  group  elements  representing  all  the  possible 
configurations  of  a square  that  can  be  generated  by  a pr  jduct  of  90°  rotations. 
This  can  be  summarized  in  tabular  form  as  follows: 


I 

II 

III 

IV 


I II  III  IV 


I II  III  IV 

II  III  IV  1 
HI  IV  I II 
IV  1 II  III 


From  inspection  of  the  table  it  is  clear  that  I is  the  identity  element  and  that 
every  element  has  an  inverse,  e.g.,  II  X IV  = I,  111  X III  =1,  etc.  To  illustrate  the 
group  operation,  (X),  by  which  these  products  of  rotations  in  the  table  above 
were  computed,  consider  the  way  in  which  one  proves  that  the  element  IV  is  the 
inverse  of  element  II  since  their  product  II  X IV  yields  the  identity  element  I (a 
0°  or  360°  rotation). 


II 

IV 

I 

/' 2 3 4\.  / 

1 2 3 4 \ 

/,  2 3 4\ 

\4  1 2 3/  ' \ 

2 3 4 1 / 

\l  2 3 4/ 

In  general,  to  multiply  one  arn  y by  another  do  the  following:  Replace  the  value 
of  the  vertex  in  a given  position  in  the  first  array  with  the  value  of  the  vertex 
found  under  the  position  with  the  corresponding  value  in  the  second  array.  For 
instance,  in  the  above  example,  II  X IV  = I,  the  products  are  computed  as 
follows:  VA  in  P,  of  II  is  replaced  by  K,  in  PA  of  IV;  Vt  in  P7  of  II  is  replaced 
by  Vj  in  Pt  of  IV,  etc.,  where  V(  and  P / denote  the  ar"  te  vertex  and 
position. 

More  importantly  (for  our  purposes),  the  group  of  rot.  ms  f<  square  has 
two  generators,  namely  II  and  IV.  Either  of  these,  if  m leratively  by 

itself,  yields  all  elements  of  the  group.  Thus,  II1  = III,  II3  = IV,  II4  = I,  IIs  = II 
and  similarly,  IV"  yields  III,  II,  I,  IV,  respectively.  This  generative  property  is 
not  trivial  since  neither  I nor  III  are  generators  of  the  group;  I"  = I since  it  is  the 
identity  and  III"  alternates  between  i and  111,  never  producing  II  or  IV  becurse 
III  is  its  own  inverse.  Many  other  groups  have  nontrivial  generators.  A most 
important  group  is  that  of  perspectives  of  solid  objects.  The  fact  that,  for  many 
objects,  a few  perspectives  provide  sufficient  information  to  specify  their  total 
shape  suggests  a way  in  which  perceptual  systems,  like  conceptual  ones,  may  te 
generative.  (Shaw,  McIntyre,  & Mace,  1974) 
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The  basic  strategy  for  testing  the  applicability  of  the  group-generator  descrip- 
tion in  explaining  generative  conceptual  systems  is  to  construct  acquisition  sets 
which  either  are  or  are  not  generators  specifying  the  total  class  of  instances 
referred  to  by  the  concept.  This  suggests  the  following  hypothesis: 

If  the  information  specified  in  the  group  generator  acquisition  set  is 
sufficient  to  allow  subjects  to  generate  the  entire  class,  subjects  should 
then  treat  novel  instances  of  the  class  in  a fashion  similar  to  the  way  they 
treat  experienced  instances  of  the  class.  In  contrast,  the  subjects  who  are 
given  a non-generator  acquisition  set  should  treat  experienced  and  new 
instances  of  the  class  differently. 

A Generative  Concept  Experiment 

To  investigate  the  above  hypothesis  Wilson,  Wellman,  and  Shaw  constructed  a 
system  consisting  of  four,  simple  geometric  figures  (a  cross,  a heart,  a circle,  and 
a square)  orbiting  alone  through  the  four  corner  positions  of  a square  card.  This 
allows  for  the  construction  of  sixteen  distinct  stimuli  (i.e.,  four  figures  X four 
positions  = 16  cards).  These  sixteen  cards  provide  the  underlying  set  over  which 
the  concept  of  orbiting  can  be  defined  by  an  appropriate  ordering  of  the  cards. 
Moreover,  the  system  of  relationships  among  the  cards  determined  by  the 
discrete  orbiting  of  the  figures,  logically  specify  a group  of  transformations 
(displacements)  that  is  isomorphic  with  the  geometric  group  of  90°  rotations  of 
the  square  discussed  earlier.  By  definition  two  specific  groups  (e.g.,  the  orbiting 
and  rotation  groups  denoted  above)  are  abstractly  equivalent  if  some  third  group 
can  be  found  to  represent  each.  The  numeric  arrays,  I-V  with  the  operation  X, 
constitute  such  a group. 

The  sixteen  cards  which  provide  the  underlying  set  for  the  “orbiting”  group 
can  be  represented  by  the  numeric  arrays  I— IV  as  follows:  let  the  top  row  of  the 
array  specify  the  corner  positions  on  a stimulus  card  while  the  bottom  row 
specifies  the  figures  that  occur  in  those  positions.  In  this  fashion,  the  columns  of 
the  arrays  HV,  reading  from  left  to  right,  denote  all  sixteen  cards  in  the  set 
underlying  the  concept  of  orbiting.  In  the  rotation  case,  each  relationship 
between  adjacent  arrays  specifies  the  new  positions  assumed  by  the  vertices  of 
the  square  as  it  rotates  discretely  through  90°;  by  contrast,  in  the  orbiting  case, 
the  relationship  between  adjacent  arrays  now  provides  a summary  of  the  new 
positions  assumed  by  each  of  the  orbiting  figures  frcm  card  to  card.  In  other 
words,  the  orbiting  of  a figure  can  be  thought  of  as  a rotation  around  an  axis 
point  outside  the  figure.  Hence  they  have  the  same  group  multiplication  table 
and  are  abstractly  equivalent. 

The  sequence  of  cards  specifying  a generator  for  the  16-card  set  used  in  the 
acquisition  phase  of  the  experiment  for  one  group  of  subjects  is  shown  in  Fig.  1 
Notice  that  the  first  four  cards  constitute  the  columns  of  array  I while  the 
second  four  cards  constitute  the  columns  of  array  II.  To  see  that  these  eight 
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FIG.  1 Group  generator  acquisition  let 


2 3** 


7 6 


cards  qualify  as  a generator  for  .e  total  set  of  cards  one  need  only  multiply 
them  together  in  the  iterative  iashion  as  discussed  earlier.  By  consulting  the 
group  multiplication  table  one  immediately  verifies  that  multiplying  array  11  (or 
IV)  by  itself  a sufficient  number  of  times  yields  all  the  arrays,  1— IV.  and 
therefore,  is  a generator  for  the  total  set  of  cards.  Also  try  consulting  the  table, 
one  can  verify  that  iterative  multiplication  of  array  III  by  itself  yields  only  I and 
III  and,  therefore,  does  not  qualify  as  a generator  for  the  group  of  sixteen  cards. 

There  is  a sense,  however,  in  which  III  is  a generator  that  specifies  an  abstract 
concept;  namely,  since  the  geometric  figures  occur  in  all  four  positions  across 
cards,  if  they  are  treated  equivalently,  then  they  do  specify  the  entire  set.  In 
order  to  minimize  the  degree  of  abstraction  (i.e.,  generality)  of  the  nongenerator 
acquisition  set,  eight  cards  were  selected  in  which  the  figures  occurred  in  only 
two  positions,  rather  than  four  positions  specified  by  III.  This  selection  guaran- 
teed that  the  nongenerator  acquisition  set  could  not  specify  the  entire  concept 
(set)  at  any  level  of  abstraction.  (Although  it  does  contain  the  generator  for  a 
system  of  diagonal  relationships.)  The  cards  in  this  new  nongenerator  acquisition 
set  used  in  the  experiment  are  shown  in  Fig.  2. 

During  recognition  both  groups  were  shown  the  eight  cards  they  had  experi- 
enced during  acquisition  plus  the  remaining  eight  novtl  instances  of  the  system. 
Additionally,  both  groups  were  shown  nine  cards  which  did  not  fit  the  system, 
that  is,  noncases.  The  noncases  were  constructed  by  using  inappropriately 
colored  geometric  forms,  forms  occurring  in  the  center  of  the  card,  and  forms 
which  were  oriented  differently  on  the  card  than  those  in  the  system,  for 
example,  a 45°  rotation  from  the  perpendicular.  The  recognition  set,  therefore, 
consisted  of  25  cards,  8 “old”  cards,  8 “new”  but  appropriate  cards  and  9 
“noncases.”  Subjects  were  shown  each  of  the  25  cards  one  at  a time  and  asked 
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FIG.  2 Nongenerator  acquisition  set. 
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to  rate  each  card  as  “old”  or  “new,”  that  is,  as  one  they  had  seen  duiing 
acquisition. 

As  might  be  expected  both  groups  consistently  rated  the  old  items  as  old  and 
identified  the  noncases  as  new  cards.  The  generator  acquisition  set  subjects  rated 
old  cards  as  old  on  80%  of  the  cases  and  the  noncases  as  new  on  99%  of  the 
cases. 

The  two  groups  were  strikingly  different,  however,  in  their  judgments  of  the 
new  but  “appropriate  within-the-system”  cards.  The  “nongroup  generator” 
subjects  correctly  identified  these  new  instances  as  new  on  more  than  90%  of  the 
cases.  In  marked  contrast,  the  “group  gener-tor”  subjects  rated  the  new  cards  as 
being  old  50%  of  the  time.  That  is,  their  judj„nents  of  the  new  but  appropriate 
instances  were  at  a chance  level  On  50%  of  their  judgments,  subjects  identified 
the  novel  instances  of  the  system  as  cards  which  they  had  experienced  during 
acquisition.  This  group  could  cleariy  discriminate  system  from  nonsystem  cards, 
as  shown  by  their  rejection  of  the  noncases;  but  they  could  not  consistently 
discriminate  experienced  instances  of  the  system  from  novel  ones. 

Two  conclusions  can  be  drawn  from  these  results: 

1.  During  acquisition  subjects  are  acquiring  information  about  the  abstract 
relations  existing  between  the  items  in  the  acquisition  set.  That  is,  they  are 
gaining  more  information  than  can  be  characterized  by  copies  of  the  individual 
cards  they  experienced. 

2.  The  information  specified  by  the  group  generator  is  sufficient  to  allow 
subjects^  to  generate  the  entire  system.  This  supports  the  claim  that  these 
subjects  knowledge  of  the  system  of  orbiting  cards  is  indeed  generative. 

The  fact  that  subjects  in  the  generator  group  could  not  consistently  dis- 
criminate between  previously  experienced  and  novel  instances  of  the  system, 
strongly  suggests  that  subjects  are  acquiring  an  abstract  relational  concept  which 
defines  a class  of  events,  not  simply  information  about  tba  specific  instances 
they  had  experienced.  Furthermore,  this  result  also  suggests  that  these  subjects 
acquired  a knowledge  of  an  event  (the  orbiting  of  cards)  that  is  truly  generative. 
(More  about  this  type  of  event  conception  will  be  said  in  the  next  section.) 

Assuming  that  subjects  are  acquiring  abstract  relational  systems  from  experi- 
ence with  the  generator  acquisition  set  rather  than  specific  memory  of  experi- 
enced instances,  the  question  arises  as  to  the  effect  of  more  experience  with  the 
acquisition  set.  Conceptions  of  memory  based  upon  the  abstraction  of  static 
features,  or  copies  of  the  experience  events,  would  predict  that  more  experience 
with  the  acquisition  set  would  facilitate  subjects’  recognition  of  new  instances  of 
the  system  as  actually  new,  that  is,  as  not  before  experienced.  If,  instead  of 
storing  copies  of  the  experienced  instances  or  abstracting  the  common  attributes 
of  the  instances,  subjects  are  acquiring  information  about  the  abstract  relations 
among  these  instances  in  the  system,  more  experience  with  the  acquisition  set 
would  not  necessarily  result  in  an  increased  ability  to  recognize  new  instances  of 
the  system  as  being  novel.  As  subjects  better  acquire  the  abstract  relational 
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system  they  would  be  moie  able  to  discriminate  instances  of  the  system  as  being 
novel.  As  subjects  better  acquire  the  abstract  relational  system  they  would  be 
more  able  to  discriminate  instances  of  the  concept  from  noncases.  However,  the 
novel  instances  of  the  system  may  be  more  difficult  to  discriminate  as  new 
precisely  because  they  are  instances  of  an  abstract  relational  system 
To  investigate  this  possibility,  four  additional  groups  were  run,  two  with  each 
of  the  two  types  of  acquisition  sets.  One  group  experienced  the  generator 
acquisition  set  twice,  and  a second  group  three  times.  Similarly,  a third  experi- 
enced the  nongenerator  acquisition  set  twice  and  the  fourth  group  experienced  it 
three  times.  Following  the  acquisition  phase,  all  groups  were  tested  for  recogni- 
tion. 

In  the  nongenerator  groups,  the  greater  amount  of  experience  with  the 
acquisition  set  resulted  in  an  increased  ability  to  recognize  the  new  instances  of 
the  system  as  new.  The  subjects  who  experienced  the  acquisition  set  three  times 
were  able  to  recognize  the  new  instances  as  new  on  100%  of  the  cases.  The 
nongenerator  subjects  were  able  to  consistently  identify  new  instances  of  the 
concept  as  new  after  one  presentation  of  the  acquisition  set,  and  the  subjects 
given  more  experience  with  tile  nongenerator  acquisition  were  even  more  accu- 
rate in  this  discrimination.  However,  the  results  obtained  with  subjects  who 
experienced  the  group -generator  acquisition  set  were  quite  different.  Not  only 
were  these  subjects  unable  to  identify  novel  instances  of  the  system  as  new,  but 
additional  experience  wiih  the  acquisition  set  decreased  the  subjects’  ability  to 
recognize  new  instances  as  being  new.  As  stated  earlier,  the  subjects  who 
experienced  the  acquisition  once  accepted  the  new  instances  as  old  50%  of  the 
time.  Subjects  who  experienced  the  acquisition  set  twice  before  recognition 
identified  the  new  instances  as  old  on  75%  of  their  judgments,  and  the  subjects 
who  experienced  the  generator  acquisition  set  three  times  identified  the  new  but 
appropriate  instances  of  the  system  as  old  on  over  80%  of  their  judgments.  All  of 
these  subjects  continued  to  correctly  recognize  the  old  instances  as  old  and 
reject  the  noncases  as  not  before  experienced. 

These  results  provide  strong  evidence  that  subjects  are  acquiring  information 
about  an  abstract  system  of  relations  and  not  simply  information  about  the 
static  properties  or  attributes  of  the  experienced  instances.  If  subjects’  judg- 
ments were  based  solely  upon  the  attributes  or  static  features  of  the  experienced 
instances,  the  subjects  would  be  able  to  recognize  new  but  appropriate  instances 
as  being  new  and  increase  experience  should  enhance  this  recognition.  As  we 
have  seen  the  results  were  not  obtained.  On  the  other  hand,  if  subjects  are 
acquiring  a generative  conceptual  system,  then,  instances  which  are  appropriate 
to  the  system  would  be  recognized  as  familiar.  As  the  abstract  conceptual  system 
is  better  learned  the  subjects’  would  be  more  likely  to  recognize  novel  instances 
of  the  system  as  belonging  to  the  system  and,  therefore,  identify  them  as  old. 

It  should  be  noted  that  these  data  provide  strong  support  for  our  hypothesis, 
namely,  that  knowledge  of  a subset  of  the  instances  of  a concept  was  in  fact 
tantamount  to  knowledge  of  all  instances  of  the  concept.  When  the  system  was 
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well  learned,  subjects  could  not  dist'ngui&h  old  from  novel  instances.  Clearly, 
experience  with  the  group-generator  acquisition  set  was  in  this  case  tantamount 
to  experience  with  *»<  entire  system. 

Finally,  it  shoulc  be  noted  that  in  these  experiments  subjects  were  not 
instructed  to  find  reladons  between  the  individual  instances,  nor  were  they  told 
that  they  would  be  tested  for  recognition.  Rather,  subjects  were  instructed  that 
we  were  studying  short-term  memory  of  geometric  forms.  Their  task  was  to 
reproduce,  by  drawing  each  card  in  the  appropriate  acquisition  set  after  per- 
forming an  interfering  task.  In  this  case,  the  abstraction  of  the  systematic 
relations  between  instances  of  the  system  appears  to  be  automatic  in  the  sense 
that  it  was  not  intentional. 


EVENT  CONCEPTS  AS  GENERATIVE  KNOWLEDGE 

In  this  section  we  present  evidence  in  support  of  the  contention  that  event 
concepts  are  abstract  in  nature  and  therefore  generative. 

Shaw,  Mclnty.e,  and  Mace  (1974)  argue  that  perceiving  the  nature  of  events 
involved  the  detection  of  sufficient  information  to  specify  their  affordance 
structure.  The  term  “affordance”  is  borrowed  from  James  Gibson  (1966)  and 
refers  to  the  invariant  perceptual  information  made  available  by  objects  and  events 
that  specifies  how  animals  and  humans  might  adapt  to  their  environments. 

The  affordance  structure  of  events  consists  of  two  necessary  components:  the 
transformation  over  which  the  event  is  defined  (the  transformational  invariant) 
and  the  structures  which  undergo  the  change  wrought  by  the  application  of  the 
hansformation  (the  structural  invariant).  The  transformational  invariant  must  be 
perceptually  specified  in  the  acquisition  set  if  the  dynamic  aspects  of  the  event 
are  to  be  identified  (e.g.,  that  the  event  is  ofx  running,  rolling,  growing,  smiling, 
etc.),  while  the  structural  invariant  must  be  perceptually  specified  if  the  subject 
of  the  event  is  to  be  identified  (e.g.,  what  x is:  John  runs,  the  ball  rolls,  the 
flower  gtows,  Mary  smiles,  etc.).1  A set  of  instances  of  an  event  is  not  an 

Perhaps  a better  way  to  clarify  the  difference  between  structural  and  transformational 
invariants  is  as  follows:  given  that  Jo*r.  runs,  John  walks,  John  smiles,  John  loves,  the 
subject  of  all  these  events  is  John;  me  subject’s  structure  is  what  is  common  or  invariant 
and,  hence,  is  the  structural  invartuit  of  all  the  events  denoted  On  the  other  hand,  given  the 
following  events:  John  runs.  Bill  -jns,  Mary  runs,  Jill  runs,  then  there  is  no  commo..  subject. 
All  we  know  is  that  some  objec  with  a minimal  structure  to  support  the  operation  to  run  is 
involved.  The  operation  on  the  minimal  structure  x is  the  transformational  invariant.  But 
note  that  even  here  to  define  that  transformation  presupposes  some  minimal  structural 
invariant  at  its  necessary  support.  A similar  argument  for  the  necessity  of  postulating 
minimal  transformational  Invariants  ir.  order  to  define  structures  can  also  be  given.  For 
these  reasons  the  affordance  structure  of  events,  inclusive  of  actions  and  objects,  necessarily 
requires  both  structural  and  transformational  invariants  for  its  definition.  Since  in  ecological 
science  there  are  only  affordance  structures,  that  is,  animal-environment,  or  subject-object 
relations,  the  affordance  concept  is  a universal  semantic  primitive  that  deserves  careful  study 
by  cognitive  psy chologis ts. 
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exemplify  set  and,  therefore,  doe*  not  constitute  a generator  *et  for  the  event,  if 
it  fails  to  provide  perceptual  information  sufficient  to  specify  both  the  transfor- 
mational and  structural  invariants.  To  summarize  this  hypothesis: 

All  necessary  conditions  being  satisfied,  a person  will  acquln  • the  concept 
of  an  event  when  presented  with  an  acquisition  set  of  exemplary  in- 
stances (a  generator  set)  because  such  a set  provides  the  minimal  per- 
ceptual information  sufficient  to  specify  the  affordance  structure  of  the 
event. 

In  the  experiment  discussed  in  the  previous  section  we  showed  how  a certain 
subset  of  object  configurations  qualified  both  formally  and  psychologically  as 
the  generator  set  for  an  event  concept,  that  of  an  “orbiting"  event  defined  over 
geometric  forms.  Thus,  the  group  generator  description  does  seem  to  offer  a 
viable  means  of  making  explicit  the  manner  in  which  abstract  conceptual 
systems  may  be  creative. 

The  abstract  concept  derived  from  perceiving  the  orbiting  of  the  stimulus 
figures  can  be  analyzed  as  follows:  The  generator  set  in  the  acquisition  phase  of 
the  experiment  consisted  of  stimulus  configurations  sufficient  to  specify  a 
subgroup  of  the  displacement  group,  namely,  the  orbiting  group.  This  set  of 
stimuli  constituted  the  structural  invariant  of  the  event  while  the  group  opera- 
tion (orbiting)  constituted  the  transformational  invariant  of  the  event.  The 
subjects  succeeded  in  obtaining  the  concept  of  this  event  by  detecting  these  two 
invariants  which  taken  together  constitute  the  affordance  structure  of  the  event. 

The  orbiting  group  itself  provides  ?.  description  of  the  relevant  aspects  of  the 
abstract  concept  of  the  event.  Thus  construed,  the  perceived  meaning  of  the 
event  is  the  orbiting  group  interpreted  over  the  stimulus  structures  presented. 
Consequently,  we  see  no  way  or  reason  to  avoid  the  conclusion  that  in  all  event 
perception  situations  Lhe  existence  of  an  abstract  concept  is  entailed.  Under  this 
view,  the  generator  for  the  abstract  event  concept  is  that  set  of  instances  which 
conveys  sufficient  perceptual  information  to  specify  both  the  transformational 
and  structural  invariants  defining  the  went.  The  meaning  of  this  invariant 
information  for  the  human  or  animal  perceiver  is  the  affordance  structure  of  the 
event. 

In  our  opinion,  this  analysis  argues  in  favor  of  the  hypothesis  that  perception 
is  a direct  apprehension  of  the  meaning  of  events  insofar  as  their  affordance 
structure  is  concerned.  Since  abstract  concepts  are  generatively  specified  by  (i.e., 
abstractly  equivalent  to)  their  exemplary  instances  (generator  set),  their  acquisi- 
tion can  also  be  considered  direct,  requiring  no  augmentation  by  voluntary 
inferential  processes.  Similarly,  no  constructive  cognitive  process  need  be  pos- 
tulated to  explain  how  abstract  concepts  are  built  up  out  of  elementary 
constituents  as  argued  by  the  British  Empiricists  since  such  elementary  con- 
stituents play  no  necessary  role  in  the  definition  of  the  concept. 

Have  we  made  too  much  of  the  apparent  success  of  the  generative  systems 
approach  in  a single  line  of  experiments?  It  is  important  to  ask  whether  the  same 
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analysis  can  be  applied  to  a variety  of  experimental  phenomena.  We  explore  this 
possibility  in  the  next  section. 


Perceiving  the  Affordance  Structure  of  Elabor^e  Events 

So  far  we  have  presented  an  example  involving  an  event  whose  affordance 
structure  can  be  formally  described  by  very  simple  group  stmetures,  i.e.,  orbit- 
ing. We  would  now  like  to  discuss  two  complex  events  whose  affordance 
structures,  although  more  elaborate,  still  seem  amenable  to  generative  systems 
theory. 

The  shape  of  nonrigid  objects.  Theories  of  object  perception  usually  attempt 
to  explain  the  perception  of  objects  and  patterns  which  do  not  change  their 
shape  over  time.  However,  a truly  adequate  theory  must  also  explain  the  origin 
oi  concepts  of  events  where  object  configurations  or  the  shape  of  objects 
un  ergo  dynamic  change.  Shaw  and  Pittenger  (in  press ) have  conducted  a series 
of  experiments  designed  to  explore  this  problem  The  assumption  behind  the 
research  is  this: 

Shape  is  ;onsidered  tc  be  an  event-dependent  concept  rather  than  an  absolute 
property  of  static  objects.  This  is  contrary  to  the  traoi'.onal  view  that  identifies 
shajx  with  the  metric-Euclidean  property  of  geometric  rigidity  under  transfor- 
mation, that  is,  the  fact  that  under  certain  transformations  (e.g.,  displacements) 

Uie  distances  between  points  on  an  object  do  not  change.  Unfortunately,  this 
definition  is  too  narrow  since  it  fails  to  apply  to  a manifold  of  natural  objects 
which  remain  identifiable  in  spite  of  being  remodeled  to  some  extent  by  various 

nonrigid  transformations  (e.g.,  growth,  erosion,  plastic  deformation  under 
pressure). 

Biomorphic  forms,  such  as  faces,  plants,  bodies  of  animals,  cells,  leaves,  noses, 
inevitably  undergo  structural  remodeling  as  they  grow,  although  their  transforms 
retain  sufficient  structural  similarity  to  be  idenrified.  Such  forms,  like  geological 
structures  under  plastic  deformation  or  archaeological  artifacts  under  erosion, 
are  relativr.y  nonrigid  under  their  respective  remodeling  transiormations.  Since 
the  property  of  geometric  rigidity  is  not  preserved  by  any  of  these,  it  cannot 
provide  the  invariant  information  for  their  identification.  Clearly,  then,  a new 
and  more  abstract  definition  of  shape  must  be  found  upon  which  to  develop  a 
theory  of  object  perception  that  is  broad  enough  in  scope  to  encompass  all  V 
objects  rigid  as  well  as  nonrigid  ones.  Consequently,  the  following  definition 
was  decided  upon:  Shape,  as  an  event-perception  concept,  is  to  be  formally 
construed  to  mean  the  sum  total  of  invariant  structural  properties  by  which  an 
object  might  be  identified  under  a specified  set  of  transformations. 

This  definition  should  sound  familiar  since  it  is  but  a restricted  version  of  the 
definition  of  the  affordance  structure  of  objects  given  earlier.  But  notice,  that  by 
this  definition  the  geometric  rigidity  of  an  object  under  displacement  is  but  oi  e 
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of  the  many  kinds  of  structural  invariants  possible.  By  a careful  study  of  the 
perceptual  information  used  to  identify  human  faces  at  different  stages  of 
growth  (i.e.,  age  levels),  it  was  hoped  that  the  generality  and  fruitfulness  of  the 
event-perception  hypothesis  might  be  further  tested. 

Perceiving  the  shape  of  faces  as  a growth  event.  Faces,  no  less  than  squares  or 
other  shapes  are  dynamic  events  since  their  affordance  structure  (e.g.,  shape)  is 
derived  from  a growth  process  (t  re  transformational  invariant)  which  presewes 
sufficient  structure  (structural  invariant)  to  specify  the  identity  of  the  face  of 
the  person  undergoing  the  aging  transformation  (growth).  In  a similar  fashion, 
different  people  at  the  same  stage  of  growth,  can  be  perceived  as  being  at  the 
same  age  level  because  growth  produces  similar  effects  over  different  structures 
(Pittenger  & Shaw,  1975).  These  common  effects  constitute  the  information 
specifying  the  transformational  invariant  of  the  growth  process.  Thus,  each 
transformation  can  be  identified  by  the  style  of  change  wrought  over  various 
objects  to  which  it  is  applied. 

In  addition  to  empirically  discovering  the  invariant  information  specifying  the 
identity  of  the  structures  ove  which  an  event  is  defined,  a problem  of  equal 
weight  for  the  event  percepnon  hypothesis  is  to  isolate  the  invariant  information 
specifying  the  transformation  by  which  the  dynamic  aspect  of  an  event  is 
defined.  Both  of  these  informational  invariants  must  be  found  in  every  event 
perception  experiment  if  the  affordance  structure  of  the  event  being  studied  is 
to  be  experimentally  defined.  Pittenger  and  Shaw  conducted  the  following 
experiments  in  an  attempt  to  discover  the  affordance  structure  of  the  growth 
event  defined  over  human  faces.  The  biological  literature  suggests  two  classes  of 
transformations  for  the  specification  of  the  transformation  of  skull  growth, 
strain  and  shear.  A strain  is  a geometric  transformation  which,  when  applied  to  a 
two-dimensional  coordinate  space,  changes  the  length  of  the  units  along  one  axis 
as  a transformation  of  .he  units  along  the  other  axis.  For  instance,  a strain 
transformation  can  take  a square  into  a rectangle  or  vice  versa.  On  the  other 
hand,  a shear  is  a geometric  transformation  which  transforms  the  angle  of 
intersection  of  the  coordinate  axis,  say  from  a right  angle  to  something  less  or 
more  than  a right  angle.  Such  a transformation  might  take  a square  into  a 
rhombus.  Consequently,  Pittenger  and  Shaw  constructed  a set  of  stimuli  by 
having  a computer  apply  different  degrees  of  these  two  transformations  to  a 
human  facial  profile,  and  then  by  photographing  the  computer  plotted  trans- 
forms of  the  given  profile.  Three  experiments  were  run  to  test  the  hypothesis 
that  the  perception  of  age  level  is  derived  from  information  made  available  by 
growth  events. 

To  illustrate  the  application  of  these  transformations  to  faces,  we  will  describe 
the  production  of  stimuli  for  the  first  experiment.  The  stimuli  were  produced  by 
applying  combinations  of  these  transformations  globally  to  a two-dimensional 
Cartesian  space  in  which  the  profile  of  a 10-year-old  boy  had  been  placed  so  that 
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chosen  to  test  the  supernormal  stimuli  hypothesis.  Supernormal  stunuli  are 
produced  by  exaggerating  some  relevant  aspects  of  a stimulus.  Ethologists  claim 
that  such  stimuli  lead  to  exaggerated  responses  (Tinbergen,  ) 951). 

Experiment  1 

To  test  the  effects  of  the  shape  changes  induced  by  shear  and  strain  the  profiles 
shown  in  Fig.  1 were  presented  by  slide  projector  to  the  subjects  in  a task 
requiring  magnitude  estimates  of  age.  The  subjects  were  instructed  to  rate  the 
ages  of  the  profiles  by  choosing  an  arbitrary  number  to  represent  the  age  of  the 
first  profile  and  assigning  multiples  of  this  number  to  represent  the  age  of 
succeeding  profiles  relative  to  the  age  of  the  first.  Twenty  subjects  were  asked  to 
rate  the  35  slides  resulting  from  the  transformations  described  above.  The  results 
were  straightforward.  Using  a Monte  Carlo  technique  Pittenger  and  Shaw  found 
that  91%  of  the  judgments  made  by  the  subjects  agreed  with  the  hypotheses  that 
the  strain  transformation  produced  monotonic  perceived  age  changes  in  the 
standard  profile.  On  the  other  hand,  using  the  shear  transformation  to  predict 
judgments  produced  only  66%  agreement.  Since  strain  was  by  far  the  strongest 
variable  of  age  change,  we  decided  to  test  the  sensitivity  of  subjects  to  very  small 
changes  in  profiles  due  to  this  transformation. 


Experiment  2 


Sensitivity  to  the  shape  changes  produced  by  the  strain  transformation  was 
assessed  in  the  second  experiment  by  presenting  pairs  of  profiles  produced  by 
different  levels  of  the  transformation  and  requiring  subjects  to  choose  the  older 
profile  in  each  pair.  A series  of  profiles  was  produced  by  applying  strain 
transformations  ranging  from  k * -0.25  to  +0.55  to  a single  profile,  where  & is 
the  coefficient  of  strain  used  in  the  equation  controlling  the  computer  plots. 
Eighteen  pairs  of  profiles  were  chosen;  three  for  each  of  six  levels  of  difference 
in  degree  of  strain.  The  pairs  were  presented  twice  to  four  groups  of  ten  subjects. 
Different  random  orders  were  used  for  each  presentation  and  each  group. 
Subjects  were  informed  that  the  study  concerned  the  ability  to  make  fine 
discriminations  of  age  and  that  for  each  pair  they  were  to  choose  the  profile 
which  appeared  to  be  older.  During  the  experiment  they  were  not  informed 
whether  or  not  their  responses  were  correct.  By  correct  response  we  mean  the 
choice  of  the  profile  with  the  larger  degree  of  strain  as  the  older. 

Several  results  were  found.  An  analysis  of  variance  on  percentage  of  errors  as  a 
function  of  difference  in  strain  showed  a typical  psychophysical  result-a  decline 
in  accuracy  with  smaller  physical  differences  and  an  increase  in  sensitivity  with 
experience  in  the  task.  However,  two  other  aspects  of  the  results  are  more 
important  for  the  question  at  hand.  First,  subjects  do  not  merely  discriminate 
the  pairs  consistently  but  choose  the  profile  with  the  larger  strain  as  the  older 
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profile  with  greater  than  chance  frequency;  in  the  first  presentation  the  larger 
strain  was  selected  on  83.2%  of  the  trials  and  the  second,  on  89.2%  of  the  trials. 
In  each  presentation,  each  of  the  40  subjects  selected  the  profile  with  the  larger 
k as  older  on  more  than  50%  of  the  trials.  In  other  words,  the  predicted  effect 
was  obtained  in  every  subject.  A sign  test  showed  the  change  probability  of  this 
last  result  to  be  far  less  than  .001 . Thus,  the  conclusion  of  the  first  experiment  is 
confirmed  in  a different  experimental  task.  Second,  sensitivity  to  the  variable 
proved  to  be  surprisingly  fine. 

Experiment  3 

A third  experiment  was  designed  to  determine  if  a structural  invariant  existed  by 
which  individual  identity  might  be  perceived  as  follows.  We  have  all  had  the 
experience  of  recognizing  someone  we  know  as  a child  years  later  when  they 
have  grown  to  maturity.  As  a preliminary  test  of  preservation  of  identity  under 
the  strain  transformation,  profile  views  of  the  external  portions  of  the  brain 
cases  of  six  different  skulls  were  traced  from  x-ray  photographs  and  subjected  to 
five  levels  of  strain.  Five  pairs  of  transformed  profiles  were  selected  from  each 
individual  sequence;  the  degree  of  strain  for  members  of  three  pairs  differed  by 
0.30  and  those  of  the  other  two  pairs  by  .45  values  of*.  A profile  of  a different 
skull  was  assigned  to  each  of  the  above  pairs  which  had  the  same  degree  of  strain 
as  one  of  the  members  of  tie  pair.  Slides  were  constructed  of  the  profile  triples 
such  that  the  two  profiles  from  distinct  skulis  which  had  the  same  level  of  strain 
appeared  in  random  positions  at  the  bottom.  Thirty  subjects  were  presented  the 
slides  and  asked  to  select  which  of  the  two  profiles  at  the  bottom  of  the  slide 
that  appeared  most  similar  to  the  profile  at  the  top.  The  overall  percentage  of 
errors  was  low:  for  the  30  sets  of  stimuli  presented  to  30  subjects,  the  mean 
error  was  less  than  17%,  with  no  subject  making  more  than  33%  errors.  Since  no 
subject  made  50%  or  more  ; r o s,  a sign  test  on  the  hypothesis  of  chance 
responding  (binomial  distribution;  by  each  subject  yields  a probability  of  far  less 
than  0.001 . indeed,  in  another  set  of  studies,  Pittenger  and  Shaw  also  found  that 
people  are  quite  able  to  rank  order  by  age  photographs  of  people  taken  over 
nearly  a decade  of  growth  from  pre-  to  post-puberty  years. 

The  results  of  these  three  studies  provide  support  for  two  important  hypoth- 
eses. the  strain  transformation  due  presumably  to  growth,  not  only  provides  the 
major  source  of  the  relevant  perceptual  information  for  age  level,  but  also  leaves 
invariant  sufficient  perceptual  information  for  the  specification  of  the  individual 
identity  of  the  person  by  the  shape  of  the  head  alone. 

These  experiments  also  support  the  contention  that  the  perceived  shape  of  an 
object  is  not  simply  the  shape  of  a static,  rigid  object,  but  is  rather  a higher  order 
structural  invariant  which  remains  relatively  unchanged  by  the  events  (i.e., 
transformational  invariants)  into  which  such  objects  may  enter.  Further  dramat- 
ic support  for  this  claim  is  provided  by  the  fact  that  the  identity  of  human  faces 
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is  preserved  under  elastic  transformations  as  distinctive  from  growth  as  artistic 
characterization.  The  success  of  political  caricaturists  rests  on  their  ability  to 
satirize  a political  figure  by  exaggerating  distinctive  body  or  facial  features 
without  obscuring  the  identity  of  the  famous  or  infamous  personage  depicted. 
Indeed,  there  is  evidence  that  such  an  artistic  redition  of  complex  structures 
facilitates  their  ^identification  (Ryan  and  Schwartz,  1956).  But  will  the  event 
perception  hypothesis  apply  equally  well  to  still  more  elaborate  events  in  which 
complex  transformations  are  defined  over  a variety  of  structures? 

Perception  of  a tea-making  event.  Recently,  at  the  Center  for  Research  n 
Human  Learning,  Jerry  Wald  and  James  Jenkins  have  been  investigating  the 
generative  nature  of  an  elaborate  event:  the  act  of  preparing  tea.  To  study  this 
event  24  photographs  were  taken  depicting  the  various  steps  invoked  in  the 
preparation  of  tea.  These  stimuli  were  presented  to  subjects  following  the  same 
experimental  design  used  in  the  “orbiting”  event  experiment  discussed  earlier. 
Sixteen  of  these  24  pictures  were  used  as  an  acquisition  set  portraying  the 
tea-making  event  to  subjects.  Later,  these  1 6 pictures,  plus  the  remaining  8 from 
the  original  set,  were  shown  to  the  subjects,  who  were  asked  to  indicate  whether 
the  picture  was  new  or  one  which  occurred  during  acquisition.  The  subjects  were 
unable  to  distinguish  the  new  but  appropriate  pictures  of  the  event  from  the 
pictures  they  had  actually  experienced  during  acquisition.  Once  again  we  see 
that  a partial  subset  of  the  possible  instances  of  an  event  can  specify  the  entire 
event. 

The  general  results  found  in  this  experiment  were  essentially  the  same  as  those 
found  in  the  case  of  the  “orbiting”-event  experiment  reported  earlier.  Namely,  it 
was  again  found  that  subjects  were  very  good  at  recognizing  as  new  pictures 
which  were  physically  similar  to  those  in  the  acquisition  set  but  inappropriate  as 
elements  in  the  event.  For  example,  if  a type  of  movement  or  direction  of 
movement  Inappropriate  to  the  event  portrayed  during  acquisition  was  depicted, 
subjects  classified  the  picture  as  new.  Clearly,  the  knowledge  which  subjects 
gained  during  acquisition  was  knowledge  of  an  abstract  system  of  relations,  that 
is,  an  event,  not  knowledge  of  exact  copies  of  the  exemplars  specifying  the 
event.  Additional  support  for  the  contention  that  subjects  are  acquiring  a 
generative  system  of  relations  is  provided  by  the  finding  that  subjects  vho  were 
provided  more  experience  with  the  acquisition  pictures  were  ev;n  more  likely  to 
mistake  the  novel  but  appropriate  instances  for  those  actually  s en. 


GENERAL  CONCLUSIONS,  AND 
IMPLICATIONS  FOR  INSTRUCTION 

Each  of  the  event  perception  experiments  discussed  is  rot  only  amenable  to  a 
generative  systems  explanation  but  seems  to  require  t.  The  range  of  events 
surveyed,  from  simple  events  such  as  orbiting  objects,  to  more  elaborate  events 
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involving  growth  of  human  faces  and  the  preparation  of  tea,  suggests  that  the 
ability  to  formulate  abstract  concepts  is  a basic  cognitive  capacity  underlying 
nowledge  acquisition.  This  characterization  of  knowledge  acquisition  has  sever- 
al important  implications  for  a theory  of  instruction.  non  nas  sever 

The  most  general  implication  concerns  how  we  should  conceptualize  the 
nature  of  those  situations  in  which  we  accrue  useful  knowledge  abou“  ! 

exoeriences  UFfh  80C‘3  ’ “d  pr°fessi0nal  environments.  If  the  majority  of  our 
eS  or  fresh  f 31638  7°^  enC0Unters  with  either  novel  instances  of  old 
blvond  c I f T ? "eW  eVe"tS’  thCn  the  g0als  on  instruction  must  go 
sh  TA  , f°r  h°W  particulars  may  be  learned.  Rather,  the  primary  goal 
should  be  to  train  people  to  exploit  more  efficiently  their  cognitive  capacity  to 
assimilate  knowledge  that  is  abstract  and,  therefore,  generative  ? * 

,,°re0’er’lfadaptive  responding  to  even  ordinary  events,  such  as  recognizing 
faces  °r  making  tea,  entails  generative  knowledge  of  abstract  relationships  then 

af  cl  nt  phll  A f0ridH63ling  Wlth  high6r  f°rmS  °f  knowIedge  " -ch  fields 

edee  !n All  V'  ^ hiSt0ly’  01  law'  Acquisition  of  knowl- 

g all  areas  is  a result  of  abstraction  over  well  chosen  instances  of  events 

Accent  "A'8  mStantiate  the  generator  sets  for  the  concepts  involved’ 

IZ  Tf 6 l rf  "S'  What  POtential  impact  n’igb1  such  a cognitive 
theory  of  knowledge  have  upon  current  educational  practices?  ^ 

u 7 Str?C‘i0n  should  Primarily  consist  of  lessons  in  which  stu- 

tte  saw  ifS  c TfT*  ^ thoSC  Core  ConcePts  of  the  field, 

we  saw  in  the  case  of  the  experiments  reported,  the  abstraction  of  generative 

ys  ems  requires  first-hand  experiences  with  those  exemplary  instances  of  a 

tySfndCOsnSl,tf  ltS  g6rIerat0r  Set'  H°wever>  0U1  educati°nal  institutions 
ypicJly,  and  sometimes  exclusively,  use  the  “lecture”  technique  by  which 

' P“!i"  r“‘Pien,S  °f  ,h'  “ tap,So„s  fawn 

from  another  persons  experience,  where  in  actuality  some  version  of  the 

VaTr^Tfo™  of7'd  ^ aFmUCh  b6tt6r  f0mi  °f  instruction’  Instruction,  which 
takes  the  form  of  leammg  facts  or  principles  about  some  concept  * is  not  a 

ef  fah  f(  th°Ugh  “ may  ^ 3 USCful  suPPlement>  I°r  acquiring  direct  expert- 

typ  ca  fo  mC°Af  *’  T ^ Pr6Sented  ^ S°me  3nal0giCal  °r  Simpler  pr^t0- 

ypeal  torm.  This  is  why  some  courses  of  study  wisely  relv  heavilv  ,mnn 

fAsThLd  ^th  Tri6nCe-  A66'1’  CVery  Cl3SSr0°m  Sh0uld  66  a “'Oratory” 

tor  first-hand,  rather  than  second-hand  experiences.  * 

There  is  little  new  in  the  above  obseivations  regarding  the  preference  of  active 

leamTZfis  7^  ^ ^ reCepti°n  of  material  to  be 

LZ7 J ’ h0wever-  15  the  insigbt  that  the  generative  capacity  to 

ormulate  abstract  concepts  may  be  naturally  engaged  when  the  student  expert- 

selection^ rex  SAet  °ff eXemplarS>  the  generator.  Therefore,  the 

selection  of  the  exemplars  of  a concept  to  be  taught  is  a very  different  affair 
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requiring  the  joint  efforts  of  cognitive  psychologists  and  instructional  experts. 

. Another,  related  implication  of  the  generative  characterization  of  con- 
ceptual knowledge  is  to  offer  a new  theory  of  the  transfer  of  training  effects  so 
desirable  throughout  the  educational  progress  of  a student.  How  do  old  concepts 
facilitate  the  learning  of  new  ores;  and  how  does  new  information  become 
integrated  with  existing  information? 

Since  'he  generative  nature  of  knowledge  has  not  been  seen  as  implying  a core 
cognitive  capacity,  both  specific  and  general  transfer  have  been  seen  as  a second- 
ary, spin-off  effect  of  learning  specific  reactions  to  specific  objects  or  events.  We 
ave  attempted  to  show  throughout  this  chapter  why  this  characterization  of 
learning  is  backwards.  The  generative  cognitive  capacity  that  is  responsible  for 
transfer  is  not  derivative  from  or  based  upon  knowledge  of  exact  replicas  or 
copies  of  experiences.  It  is  the  abstractness  of  concepts  that  accounts  for  the 
generality  or  transfer  of  conceptual  knowledge  gained  in  one  situation  to  new 
situations.  Transfer,  therefore,  is  inherent  in  the  acquisition  of  abstract  concepts. 

3.  A final  implication  of  this  theory  for  instruction  related  to  the  selection  of 
criteria  for  evaluating  performance  as  an  indicant  of  the  state  of  conceptual 
knowledge  attained  by  the  student.  This  has  proven  to  be  a most  difficult  and, 
somewhat  surprisingly,  a most  controversial  issue.  The  proposed  theory  surest s 
why  this  is  so. 

A major  source  of  difficulty  in  the  evaluation  of  the  knowledge  a student  has 
acquired  is  to  know  what  types  of  performance  count.  There  are  several 
performance  levels  students  may  attain  due  to  either  their  sophistication  in  an 
area,  their  motivation,  or  the  nature  of  the  concepts  to  be  learned.  First,  and 
easiest  to  evaluate,  is  the  ability  to  verbalize,  or  articulate  in  some  other  overtly 
demonstrable  form,  exactly  what  they  know  about  a topic.  Unfortunately,  this 
level  of  performance  is  exhibited  inadequately  by  most  people  and  tends  to  be 
rare  except  for  simplistic  cases  where  a rote  memorization  of  particulars  is 
appropriate.  Such  knowledge,  however,  is  not  necessarily  generative  in  nature 
and,  thus,  its  successful  evaluation  poses  no  guidelines  for  an  adequate  evalua- 
tion of  students’  abilities  to  use  abstract  conceptual  knowledge. 

A second  and  more  frequently  exhibited  performance  level  is  that  the  student 
has  attained  useful  knowledge  of  a topic  but  is  unable  to  articulate  what  he  or 
she  knows.  Clearly,  a very  important  goal  of  education  is  to  bring  a novice  in 
some  subject  matter  area  up  to  the  level  of  an  expert.  Indeed,  often  we  would  be 
very  happy  if  our  pedagogical  attempts  had  even  more  limited  success  in  that  the 
student  somehow  learned  to  make  sound  judgments  although  remaining  unable 
to  articulate  the  basis  for  the  judgments. 

This  state  of  affairs,  rather  than  being  rare  among  experts,  is  actually  very 
common.  Few  experts  can  specify,  in  algorithmic  clarity,  the  reasoning  process 
mey  go  through  in  order  to  arrive  at  a sound  judgment  with  respect  to  a problem 
in  their  area  of  expertise,  although  they  may  present  a learned  rationalization 
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afterwards.  Man)  art  connoisseirs  are  able  to  distinguish  styles  of  artists  cate- 

SutT  8k!°  Cra’  CldtUrC  °r  S°h001,  Vari°Ui  artifacts  Presented  to’ them 
ithout  being  able  to  say  beforehand  the  criteria  they  use.  Similarly  chess 

to  s«c-f  °Utc0mes  of  “d  strategies  of  opponents  without  being  able 
to  specify  a priori  criteria  for  doing  so.  Indeed,  in  all  areas  of  knowledge  to  be 
an  expert  is  synonymous  with  the  ability  to  render  objectively  sound  judgments 
without  necessarily  being  able  to  specify  every  step  in  the  raStif 

exwrts  would  onl^  S0  d‘fflCUlt  t0  do  so’  educating  novices  to  the  level  of 
experts  would  only  require  rote  memorization  of  algorithmic  judgmental  mo- 

cedures  applied  to  rotely  memorized  banks  of  data.  P 

edge  Va^^buVo^fth1?  ^ ***!?*  d°’  °f  C°UrSe’  promu,8ate  from  a knowl- 
edge oasis,  but  one  that  is  usually  more  tacit  than  explicit.  In  short  exnert 

SlaST!  liV  bfy;Pr0dUCt  0f  generative  V^em*  of  knowledge  rather ’thaTof 
t data  banks  of  factual  information.  Consequently,  in  our  effort  to  evaluate 

“ SCr  l°  311  eXpert’Sjudgmental  aMity  education  has  moved  novices 
ohtai^H  ^ assess  primarily  the  degree  of  generative,  tacit  knowledge  they  have’ 
obtained  and  not  just  their  explicit  knowledge  of  itemized  facts  Based  on  the 
current  theory  and  findings,  we  suggest  that  the  following  questions  should  be 

answered  by  any  knowledge  evaluation  procedure: 

a.  Car  Ihe  student  identify  the  same  set  of  dear-cases  of  the  concent  that  a 

nT^si;rsc2r upon!  This  - *•  ** « s***. 

instance  of  the  concept", om'^Jitta  tog'^slhket 

an  t^htnce  previously  studied,  even  though  i,  was,  in  fact,  never  " 

c Can  the  student  deal  with  novel  instances  of  :he  concept  with  the  same 
facility  shown  with  familiar  ones?  And  corollary  to  both  (b)  and  (cV 

J'  T ,thC  rePeated*y  stored  student  display  an  inability  to  recall  whether 
he  or  she  has  seen  relevant  particulars  about  the  concept  area  before  while  at 

Le'nTf^hrfs  C°nSlderable  COr  fidence  ^ irrelevant  particulars  were  not 

Seed  Zt  r TF11*"  CritCri0n  f°r  determining  if  the  studen,  has 

articltton  In £ **  lhat  are  n0t  CCCessib,e  to  conscious 

T fact>  the  Proportion  of  false  positives  in  recognition  tests  mav 

leT  C hC  °fr  W3y  !°  detemiine  Whether  311  inarticulate  studTnt  has  neverthe 
1 t SUf  fiC'ent  "°WledgC  f°r  making  sound  intuitive  judgments  (on  The 
sumption,  of  course,  that  a comparison  of  the  students’ judgments  with  that  of 
experts  is  not  directly  feasible).  1 ^ ™ 0131  ot 

fin°a^r!y,  therVS  Stil!  mUCh  WOrk  l°  be  accomplished  before  drawing  any 

ontoinn  T"*  3 °Ut  the  Pr°P0Sed  the°ry- 11  already  exhibits>  howcver,  in  our 
opinion  sufficient  promise  in  both  theoretical  and  practical  areas  to  merit 

further  development  by  both  cognitive  and  educational  psychologists. 
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Toward  a Theory 
of  Instructional  Growth 

S.  Farnham-Diggory 

University  of  Texas  at  Dallas 


From  1967  to  1971,  when  I was  writing  Cognitive  Processes  in  Education 
(Famham-Diggory,  19/7),  I was  largely  alone  in  attempting  to  relate  informa- 
tion-processing psychology  to  education.  That  has  now  changed,  and  the 
chapters  in  this  volume  represent  important  new  contributions  to  the  area.  The 
chapters  also  reveal  awareness  by  this  group  of  distinguished  scientists  of  what  a 
big  job  they  have  undertaken.  Really  understanding  educational  problems,  really 
analyzing  the  psychological  processes  involved,  really  distilling  and  applying 
psychological  principles,  are  enormously  complex  tasks-on1,  recently  under- 
taken by  theorists  capable  of  coping  with  them.  Prior  to  the  last  decade,  the  best 
theoretical  minds  in  the  business  were  working  on  problems  derived  from 
relatively  simple  learning  theories  of  the  day.  Such  theories  were  not  adequate 
for  the  proper  study  of  instructional  behavior,  although  they  were  challenging  in 
other  ways.  Finally,  there  emerged  a theoretical  framework— the  information- 
processing framework— complex  enough  to  fit  the  real  world  of  education.  As  a 
result  of  this  development,  and  of  the  caliber  of  scientific  though  it  is  fostering, 
the  next  10  or  15  years  of  psychological  research  should  produce  major  new 
insights  into  instructional  issues. 

For  this  to  happen,  however,  our  theories  must  push  well  beyond  the  descrip- 
tive level— which  is  to  say,  they  must  generate  testable  predictions  that  a smart 
educator,  flying  by  the  seat  of  his  pants,  would  not  be  likely  to  make.  As  Carroll 
(Chapter  1,  this  volume)  has  noted,  most  educational  theory  today  has  the  form 
of  cultural  tradition  passed  on  by  word  of  mouth.  Educators  trust  this  tradition 
because  it  provides  them  with  hand-holds.  But  we  know  it  will  not  get  them 
where  a good  theory  will  get  them.  The  relation  between  intuitive  practice  and 
good  theory  is  analogous  to  crossing  a stream  by  following  a personal  map  from 
rock  to  rock,  and  crossing  by  means  of  a properly  engineered  bridge.  Unfortu- 
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nately,  much  of  psychology,  purporting  to  build  the  bridge,  may  be  merely  using 
a different  language  to  describe  the  process  of  getting  from  rock  to  rock.  We  can 
see  this  by  attending  to  the  descriptions  of  good  educators  who  are  not  familiar 
with  psychological  jargon. 

An  especially  useful  and  humbling  book  in  this  regard  is  one  written  by  a man 
named  Charles  Allen  (1919),  described  on  the  flyleaf  of  his  book  as  “Sometime 
Agent  for  Industrial  Training  of  Boys  and  Men,  Massachusetts  Board  of  Educa- 
tion, and  Superintendent  of  Instructor  Training,  U.  S.  S.  B.  fUnited  States 
Shipping  Board]  Emergency  Fleet  Corporation.”  Mr.  Alien  was  bom  in  1862, 
and  this  book,  which  was  obviously  a culmination  of  30  or  40  years  of 
experience,  was  published  in  1919.  Although  some  of  thePavlovian.Watsonian, 
and  Gestalt  psychological  principles  were  around  at  that  time,  Allen  was  inno- 
cent of  them.  He  told  it  like  it  was,  without  benefit  of  any  such  newfangled 
nonsense.  The  title  of  his  book  sets  the  straightforward  tone.  It  was  called  ‘The 
Instructor,  the  Man,  and  the  Job.”  Let  me  give  you  some  examples  of  Allen’s 
instructional  wisdom: 


Whatever  the  instructor  intends  to  put  over  to  the  learner  must  be  given  to  him  in 
some  order;  it  cannot  be  given  to  him  all  at  once.  This  brings  up  the  question  as 
to  . . . whether  there  is  not  a best  instructional  order  . . . and,  if  so,  how  it  can  be 
determined. 

An  effective  order  of  instruction  . . . presents  certain  characteristics  among  the  more 
important  of  which  are: 

1.  The  different  teaching  jobs  ...  are  so  arranged  that . . . each  succeeding  job  extends 
the  learner’s  knowledge  and  skill,  but  does  not  call  for  a different  sort  of  knowledge  and 
skill. 

2.  Jobs  that  require  the  learner  to  think  of  the  least  number  of  different  things  at 
once  come  first  and  jobs  that  require  the  learner  to  think  of  the  most  different  things 
come  last. 

3.  The  jobs  are  arranged  according  to  the  difficulty  of  learning  how  to  do  them  rather 
than  according  to  the  order  in  which  they  would  be  done  in  getting  out  a finished 
product . . . That  is,  the  order  is  an  instruction  order,  not  a production  order. 

...  An  illustration  of  a course  of  instruction  which  does  not  meet  these  conditions 
would  be  the  following  procedure  . . . Where  two  rivet  holes  do  not  come  together  ‘fair’, 
they  must  be  reamed  to  a common  size.  This  is  usually  done  with  a pneumatic  or 
electric  drive  ...  A group  of  men  after  having  been  cautioned  as  to  the  control  of  the 
machine  were  placed  in  compartments  of  the  ship  to  ream  out  all  holes  that  required 
reaming.  These  holes  were,  of  course,  all  sizes,  fair  and  unfair  . . . Some  could  be  reamed 
in  easy  positions  and  some  required  difficult  positions.  Under  these  conditions  the 
learner  was  immediately  put  up  against  jobs  of  all  sorts  of  difficulty,  since  he  took  the 
holes  as  they  came.  He  worked  in  this  way  until  he  had  learned  to  ream.  Under  these 
conditions  the  learner  had  to  learn  too  many  things  at  once,  reducing  the  probability 
that  he  would  thoroughly  grasp  any  one  thing,  causing  a state  of  mental  confusion  and 
slowing  up  the  learning  operation  . . . Under  a properly  arranged  order  of  instruction  the 
work  would  have  been  so  laid  out  that  holes  of  different  degrees  of  fairness  would  have 
been  marked  so  that  the  learner  first  learned  to  ream  the  fairest  hole,  then  the  next 
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s:  — « - - 

arrangement  i.  an  example  of  bad  ins  true  tiona!  condttonr 

A*  between  the  twe .method,  of  £ ^^“^“^ofmind 

much  faster,  make  a better  workman  of  him  k«p  IUm  m » m hi, 

while  he  was  under  training  and  turn  him  out  with  a much  better  itutuae 
job  (Allen,  1919.  pp.  78-80,  italics  his]. 

We  can  list  mary  psychological  principles  contained  in  Allen’s 
From  1918  four  decades  of  discrimination  learning  theory  would  have  been 
applicable.  But  would  it  have  done  any  more  for  Allen  than  he  could  have  done 
fofhimself-even  experimentally?  Without  benefit  of  stimulus-response  jargon 
Allen  could  have  and  probably  did,  run  a crew  through  the  ordered  proced 

another  crew  through  the  unordered  procedure,  and 
measure  resulting  work  skills  and  attitudes-sufficiently  well  to  make  money 
saving  decisions  about  training  procedures.  T.aditional  d.scr.m.nat.on  lea  g 
theory  would  not  have  expedited  such  an  empirical  program  very  much.  It 
would  not  have  built  a bridge,  but  would  merely  have  renamed  some  o 
empirical  procedures.  Can  modern  information-processing  psychology  do  any 
beuer'’  Can  it  provide  new  information,  new  guidance  wluch  a smart  mstrucUr 
woiw  not  be  able  to  get  for  himself?  Or  are  we  also  jus,  renammg  some 

Cr£e  you  example.  Allen  outlined  in  detail  the  necessity -steps 

for  performing  wha,  we  would  call  a task  analysis.  For  any  g^  bbek  of^k 
ffor  example  those  of  house  carpentry),  the  instructor  was  to  de^de  what  the 
% factor,  were  (Allen’s  term).  What  had  to  be  increased?  What  had  to 

For  example,  fear  of  working  at  a height  had  to  be  decreased  in 
house  carpenters.  Accuracy  of  measurement  and  placement  had  to  be  increased. 
The  nuX  of  operations  to  be  remembered  and  performed  had  to  bemc«£d 
Havine  designated  these  progression  factors,  the  instructor  was  then  to  allocate 
jobs  which  matched  the  levels  of  progression.  For  example,  the  vc^‘*tJ° 

*Z  ££-  was  to  wotk  this  CM  b,  ptmins  .11  1*  - ™' 

fitting  them  into  a diagram  like  that  shown  in  Fig.  1.  The  Os  reter  to 
the  in  cos,  pass  belo, . be  itto.es  on  to  the  n».e 

advanced  levels. 
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house  carpentry 
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FIG.  1 Task  analysis  chart  used  by  Allen  in  designating  steps  in  the  training  of  house 
carpenters  (Alien.  1919.  p.  90). 


The  question  again  is:  do  our  modem  methods  of  task  analysis  really  add 
anything  to  this?  Or  have  we  merely  restated  in  fancier  and  more  detailed  jargon 
what  good  instructors  already  know?  That  is,  are  we  helping  these  instructors  in 
any  way? 

Two  brief  final  examples,  more  cognitive  in  nature.  Allen  talks  about  the  need 
to  teach  special  technical  terms  and  skills  on  the  job,  rather  than  in  some 
preliminary  classwork— which  is  what  he  says  most  lazy,  inexpert  instructors 
would  rather  do.  Allen  calls  this  knowledge  auxiliary  material,  and  says: 

. If  | auxiliary  material)  is  given  him  in  advance  or  the  job  he  has  nothing  to  ’tie’  it 
to;  it  makes  but  little  impression.  If  it  is  given  him  after  the  job  he  has  not  been  given  it 
when  he  has  a chance  to  apply  it.  In  either  caae,  what  ‘tying’  he  can  do  will  be  to  such  a 
general  idea  of  the  situation  u he  may  get  from  such  general  knowledge  of  the  job  as  he 
may  have  picked  up  somehow  ...  a pretty  we*x  tiling  to  tie  to ...  In  order  that  he  thall 
think  of  the  thing  in  question  when  he  itiould,  he  must  have  gotten  it  in  connection 
with  some  operation,  so  thai,  whenever  L “rforms  thai  operation  the  thing,  sty 
‘shooting  the  tool,’  will  come  up  in  his  mind  n connection  with  that  part  of  the  job.  It 
is  the  failure  to  recognize  this  fact  that  makes  ;a  mur'.i  preliminary  work  in  the 
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fundamentals’  of  so  little  value;  general  talks  on  safe  y first  are  given  and  then,  when  the 
men  on  the  job  do  exactly  what  the  instructor  warned  them  not  to  do,  he  wonders  what 
is  the  matter  and  calls  them  stupid.  He  puts  up  lessons  on  fractions,  and  then  finds  that 
his  men  do  not  know  how  to  use  fractions  on  the  job;  he  teaches  trade  terms  in  advance 
of  their  use  and,  when  they  come  up  on  the  job,  the  men  do  not  know  their  terms.  In  all 
these  cases,  the  trouble  is  that  the  instructor,  who  has  plenty  of  trade  experience  to  tie 
to,  forgets  that  the  learner  has  little  or  none;  it  is  another  case  of  the  instructor  thinking 
of  the  problem  with  his  own  brain  and  not  putting  himself  in  the  place  of  the  learner 
| Allen,  1919,  p.  103 1 . 

Allen  calls  this  correct  method  of  instruction  the  tying  up  method,  and 
provides  specifications  for  exactly  how  the  instructor  is  to  carry  this  out,  how  he 
should  decide  what  the  auxiliary  material  is,  how  he  should  time  its  introduction 
on  the  job,  and  so  forth. 

A final  example  has  to  do  with  the  very  interesting  perceptual  problem  of  how 
you  know  a rivet  is  hot  enough.  Before  introducing  this  problem  Allen  dis- 
courses on  what  we  would  now  term  the  Socratic  method  of  questioning.  Allen 
might  not  have  known  that  term,  but  he  did  know  his  man  had  to  be  taught  to 
think,  and  that  the  only  way  to  do  that  was  to  get  his  mind  going. 

In  explanation  it  may  be  stand  that,  in  riveting,  the  rivets  are  driven  hot.  They  are 
heated  in  a small  portable  furnace  or  forge.  A part  of  the  job  of  the  ‘heater  boy’  is  to 
pick  out  rivets  at  the  right  heat  foi  driving.  If  too  hot  they  are  ‘burned’  and  should  not 
be  driven.  If  too  cold  they  will  not  drive  properly  ...  The  heater  boy  is  paid  for 
knowing  how  to  pick  out  a rivet  at  the  right  heat. 

...  In  planning  a lesson  the  instructor  has  three  questions  to  answer.  First,  whet  are 
the  ideas  to  be  put  over  in  the  teaching  unit?  Second,  what  is  the  teaching  base  or 
‘jumping-off  point’  (J.O.P.)  that  is  proposed  to  be  used?  Third,  what  ideas  already  in  the 
learner's  mind  does  the  instructor  intend  to  utilize  in  carrying  the  leamer  up  to  the 
J.O.P.?  ...  A few  general  rules  can  be  given. 

Work  from  the  qualitative  or  general  notion  to  the  quantitative  or  exact  notion, 
always  ask  how  or  ‘what’  before  asking  ‘how  much' . . . General  ideas  should  be 
presented  before  specific  ideas.  For  example,  the  idea  that  the  rivet  must  be  at  a certain 
heat  to  be  right  will  be  put  up  before  the  idea  that  the  right  color  is  just  under  a white 
heat. 

...  In  order  to  put  over  these  ideas,  the  learner  must  be  made  to  have  in  his  mind  a 
certain  group  of  ideas  or  a picture  to  serve  as  a foundation  for  the  building  on  of  the 
new  idea  contained  in  the  teaching  unit.  In  this  particular  lesson,  a good  J.O.P.  is  to  have 
the  learner  thinking  of  the  problem  of  knowing  when  a rivet  is  hot  enough. 

|To  build  up  the  J.O.P. , the  instructor  is  to  ask  such  questions  as  the  following:  j 
First  lira.  (A  rivet) 

1.  Have  you  ever  seen  a rivet? 

2.  Can  you  tell  a rivet  from  a bolt? 

Second  Idea.  (A  hot  rivet) 

1 . Could  you  pick  up  a rivet  that  yc  l found  lying  around  the  yard? 

2.  Could  you  tell  a very  hot  rivet  fre  s cold  rivet  without  touching  it? 

Third  Idea.  (A  rivet  heated  enough) 

I Can  a rivet  be  heated  to  different  hots? 

2.  Would  it  make  any  differenct  what  heat  a rivet  has,  provided  it  is  hot? 
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[Trajnmg  then  includes  such  experiences  as:  ] 

SUBmqpeS|?0nS  “ “ teoU^^,y  heated  rive,s’ mean,inie  asWng 

BUI,  Pick  out  a correctly  heated  rivet 

Sam;  Did  he  do  it? 

Jack;  How  do  you  know  he  did  it? 

Tom;  You  pick  out  a rivet. 

Jack;  You  watch  him. 

Sam;  Pick  out  another  one. 

Bill;  That  wasn’t  right,  was  it? 

Jack;  Pick  out  a burnt  rivet. 

(Allen,  1919. . p.  l^V-^j')  ."*1  Umi'  Sdtlsfied  Mch  boy  know-  a properly  heated  rivet 

though  Uour* knowledge  of  the^  psycho!3*6^  ^ that 

ihink  c u/  g 01  the  Psychological  principles  involved’  The  answer  1 

solutions  were  at  hand  in  th?  • b * eVen  l°  recognize  when  some  of  the 
uni  were  at  nand  m their  own  experience  The  fart  th,. 

Let  us  consider  some  of  the  principles  this  volume  has  offered. 


DYNAMIC  THEORETICAL  MODELS 

here  s,em  from  *-■*  «— *■»> 
IChaDler  1 n ,L  , ch!m®"S  conceptual  fields.  Saw  and  Wilson 

capacity'  ^ h„Z 

zzzzrzz  imTouel;  A',en 


- ihrttlTiylf^jtt'  i rvi • II  Jn&JLmLoijaii 
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The  Hyman  study  (Chapter  8,  this  volume)  begins,  in  effect,  where  the  Shaw 
and  Wilson  study  ended:  with  a fully  formed  concept,  that  of  a professional  role. 
Hyrnan  was  concerned  with  the  power  of  such  a stereotype  to  distort  recog- 
nition memory  for  trait  adjectives.  This,  again,  is  a dynamic  theoretical  con- 
ception. Hyman’s  data  show  that  concepts  actively  influence  recognition 
memory.  In  Allen’s  world,  that  could  mptn  false  recognition  of  a heated  rivet 
whenever  a trainee  held  a strong  preconception  of  appropriate  rivet  character- 
istics. True,  Allen  was  prepared  for  false  recognitions,  and  he  structured  the 
instructional  situation  accordingly.  But  Hyman’s  research  points  the  way  to- 
ward exact  scaling  of  characteristics  that  a student  must  learn  to  notice,  and 
exact  predictions  about  the  extent  to  which  prior  learning  will  affect  detection 
skills. 

The  LNR  network,  as  described  by  Norman,  Geritner,  and  Stevens  in  this  volume 
(Chapter  9)  and  elsewhere  (e.g.,  Norman,  1973;  Lindsay  & Norman,  1972)  is  a 
fully  dynamic  theory  of  how  memory  is  modified  by  the  new  demands  that  are 
made  upon  it.  In  the  LNR  network  (as  in  the  networks  of  Collins  & Quillian, 
1972;  Schank,  1972;  and  Anderson  & Bower,  1973)  concepts  can  be  represented 
as  object-relation-attribute  constellations.  These  constellations  are  not  fixed  but 
can  be  formed  and  re-formed  as  the  occasion  demands.  Norman,  Gentner,  and 
Stevens  provide  several  examples  of  the  mental  restructuring  that  takes  place 
during  tutorials.  The  interaction  between  the  experimenter  and  the  student  in 
the  Jump  Problem  (pp.  192-193,  this  volume)  would  probably  have  been 
especially  appealing  to  Allen.  One  imagines  that  he  would  have  admired  the 
systematic  protocol  analysis,  the  step-by-step  printout  of  changes  in  the  stu- 
dent’s programming  rule.  Such  detailed  analysis  is  the  theoretical  groundwork 
necessary  to  the  discovery  and  bracketing  of  critical  teaching  moments.  Even- 
tually, we  should  be  able  to  define  Allen’s  J.O.P.  (“jumping  off  point”)  as 
rigorously  as  we  can  define  the  critical  heat  of  a rivet. 

Greeno  (Chapter  7,  this  volume)  illustrates  an  application  of  LNR  theoretical 
principles  to  a lesson  in  psychophysics.  Again,  we  see  how  knowledge  can  ue 
represented  as  a set  of  dynamic  interrelationships.  Greeno  also  asks,  in  this 
section  of  his  chapter,  what  higher-order  goals  and  constraints  affect  student 
performances.  Essentially,  we  can  say  that  there  are  rules  affecting  entry  into, 
and  use  of,  the  conceptual  network  (Lindsay  & Norman,  1972).  Before  answer- 
ing a question  about  the  facts  of  psychophysics,  one  must  answer  the  question: 
What  kind  of  an  answei  does  the  instructor  want?  Nowhere  are  such  meta- 
strategies considered  by  Alien. 

Additional  facets  of  Greeno’s  chapter  will  be  discussed  in  a later  section. 

In  summary,  we  can  say  that  our  modem  technological  and  theoretical  capacity 
to  represent  changing  conceptual  states  is  a major  advance  in  instructional 
theory.  By  virtue  of  this  capacity,  we  can  work  toward  the  precise  specification 
of  (a)  key  instructional  concepts  that  will  have  maximal  viability  and  generality; 
(b)  the  rate  at  which  new  information  will  be  assimilated  to,  or  distorted  by,  old 
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information;  (c)  criticaJ  teaching  moments;  and 
which  organized  knowledge  is  accessed. 

Now  we  turn  from  consideration  of  dynamics 


(d)  rules  governing  the  way  in 
to  consideration  of  structures. 


THE  SCHEMATIC  FRAME 

Each  of  the  chapters  under  discussion  here  has  evidenced  concern  with  mental 

— T ieSCribed  “ SChemata-  11  is  c,ear,y  the  case  that 

theories  of  instruction  will  postulate  higher-order  schemes  governing  bofh  the 

learner’s  potential  and  the  teacher’s  strategies. 

JTT  VlTf  h2Ve  diSCUSSed  relational  «**"»**  underlying  judgments  of 
gmg.  As  infant  faces  were  geometrically  transformed,  certain  feature  relation 
ships  remained  invariant.  This  invariance  could  be  called  the  identity  schema ■ the 
transformational  invanants  themselves  comprised  the  aging  schema 

tiorftask  tZZT'I  3‘  intervening  stages  in  a recogni- 

e reotfne  TP  7 3 ,r3lt’  0r  3 ,rait  "ame’  eV°kes  a broad  Personality 

stereotype.  This  stereotype,  in  turn,  increases  the  probability  that  odier  traits 

associated  with  it  will  be  noticed,  and  even  falsely  recognized. 

and  S.evens  consider  schemata  to  be  the  basic  units  of  memory  For  learning  to 

have  occurred,  relevant  schemata-that  is,  schemata  similar  to  those  activated  by 

naiJ™fem”  TnrmUS,hh3Ve  ****  m°dified  °r  recombined-  Their  “Mayon- 

with  task  demands  M SUbjeCtS  SUmm°n  Schematic  analogs  to  cope 

with  task  demands.  Mayonnaise  must  be  made  of  something  that  looks  white  and 

ta  es  spicy,  hence  it  must  be  made  of  whipped  cream  afid  mustard  Greenes 

answe^0^!  Question  aCtUa,lly  SeleCting  the  r^<  form  for  an 

answer  to  a question  is  selecting  a schema;  deciding  how  specific  to  be  is 

deciding  about  the  potential  application  of  a schema;  learning  ^what  counts  as 

an  explanation  of  something  [p.  156]”  is  learning  that  “an  explanation  is 

naft  ofih  Th  '"8  '°  rUkS’  3nd  these  rules  constitute  a schema  that  must  be 
part  of  the  student  s knowledge  [p.  1 56] 

Formulations  of  this  type,  and  the  experiments  delineating  them  have  moved 
modern  instructional  science  well  beyond  Allen’s  “tying  up  method  ” However 
ecause  we  try  to  educate  masses  of  students  simultaneously,  there  remain 
massive  discrepancies  between  the  available  schemata  of  learners  and  informa 
ion  presentee!  by  teachers.  That  is  one  aspect  of  what  J.  McV.  H^nt  (1961)  has 
called  “the  problem  ot  the  match.”  U °U  nas 

There  are  many  different  ways  in  which  the  term  schema  is  now  being  used 
roughout  psychology.  Although  this  must  trouble  theoretical  purists  it  has  an 

ZT'hT  on:  “ «*»«* 

our  heads  contain  higher-order  units  and  capacities.  Almost  everyone  it 
seems,  has  left  his  little  S-R  links  by  the  wayside.  everyone,  ,t 

diftblrH6'6  K f"6  3SPeCt  °f  behaviorism  that  cognitians  have  not  left  behind'  the 
distinction  between  competence  (or  knowledge,  or  learning)  and  performance. 
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PERFORMANCE  PROGRAMS 

With  deceptive  ease  and  fluency,  Greeno  (Chapter  7,  this  volume)  has  provided 
us  with  a feast  of  performance  models  for  lessons  in  fractions  and  geometry. 
Flow  charts,  or  production  systems— choose  the  one  that  is  most  appropriate  for 
your  theory  of  a task,  and  of  what  happens  when  that  task  is  taught.  For 
cognitive  psychologists,  this  is  a powerful  and  exciting  hypothesis-generating 
exercise.  For  educators  in  search  of  a ready-made  bridge,  it  is  often  dismaying. 
What  is  the  use,  they  ask,  of  all  this  hypothetical  formalism? 

There  are  two  answers  to  that.  First,  there  is  no  other  way  of  representing  a 
complex  instructional  situation.  Believing  that  it  can  be  represented  more  simply 
is  an  illusion.  True,  we  have  not  yet  developed  a complete  instructional  science, 
but  we  are  never  going  to  develop  one  until  we  raise  our  heads  from  the  sands  of 
oversimplification.  Lik„  the  molecule,  the  gene,  and  the  galaxy,  the  human  mind 
is  very  complicated.  We  will  never  fully  understand  it  or  devise  appropriate  ways 
of  educating  it  until  we  have  a theory  that  is  detailed  enough  to  model  it.  The 
fact  that  we  have  trouble  understanding  such  theories  does  not  alter  the  fact  that 
we  need  them,  and  that  they  represent  a proper  use  of  hypothetical  formalism. 

The  second  answer  to  a “what’s  the  use  . . . ?”  question  concerns  the  ability  of 
human  beings  to  construct  respo-  programs  (Newell,  1972a).  We  need  com- 
plex models  of  the  type  Greeno  cribes  to  account  for  the  fact  that  no  two 
people  ever  respond  to  the  same  situation  in  the  same  way,  as  well  as  for  the  fact 
that  no  one  can  exact! ' duplicate  his  own  responses.  We  are  endlessly  construc- 
tive and  inventive.  N "ling  our  capacities  in  this  respect  is  an  extremely 
complicated  issue. 

Consider  the  subject  in  Shaw  and  Wilson’s  “orbiting”  card  experiment.  The 
mathematical  system  described  on  pnges  204-205  represents  an  abstract  logical 
structure.  But  clearly  it  does  not  represent  what  the  subject  is  dr<ng  mentally  as  he 
performs  the  experimental  task.  How  might  we  represent  those  mental  processes? 
The  reader  is  invited  to  select  a modeling  schema  from  Greeno’s  menu  and  apply  it 
to  the  Shaw  and  Wilson  experiment-the  point  of  the  exercise  being  to  discover 
that  subjects  exposed  to  the  generator  set  must  have  built  a different  representa- 
tion of  the  stimulus,  because  they  saw  different  stimuli.  Subjects  shown  the 
nongenerator  set  saw  hearts,  crosses,  circles,  and  squares  in  only  two  positions  on 
the  cards.  Subjects  shown  the  generator  set  saw  the  same  stimuli  in  all  four 
positions.  To  understand  the  subsequent  confusion  manifested  by  subjects  shown 
the  generator  set,  we  must  understand  how  they  represented,  or  failed  to  represent, 
the  information— compared  to  subjects  in  the  other  condition.  Mathematical  group 
theory  does  not  predict  the  construction  of  differing  mental  representations.  For 
that,  we  need  psychological  theories  (e.g.,  EPAM,  Simon  & Feigenbaum,  1964., 
Gregg  & Simon,  1967). 

In  the  case  of  Hyman’s  experiment,  what  were  the  subjects  actually  doing? 
They  were  remembering  words  in  a recognition  paradigm.  How  was  the  proto- 
type-the  Social  Worker  or  the  Lawyer-supposed  to  be  activated  during  the  task 
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itself?  This  is  not  specified,  nor  was  the  experiment  de.i'yied  to  test  such 
specifications— e.g,,  to  test  the  possibility  that  a trait  label  which  was  never  seen, 
but  which  was  generated  entirely  by  the  stereotype,  might  take  longer  to 
recognize  (signifying  a longer  sequence  of  mental  operations).  In  the  absence  of 
a performance  theory  of  this  type,  we  are  compelled  to  wonder  if  Hyman’s 
results  are  not  simply  another  manifestation  of  a fact  reported  in  1960  by 
Wishner:  that  trait  clusters  display  intercorrelations  that  are  independent  of 
personality  constructs.  The  subject’s  false  recognition  of  trait  names  may  reflect 
the  associative  structure  of  the  language  rather  than  the  psychological  structure 
of  his  mind. 

The  problem  with  the  performance  of  subjects  in  the  studies  ieported  by 
Norman,  Gentner,  and  Stevens  plagues  all  the  “networkers.”  How  do  we  account 
for  the  fact  that  the  network  was  not  infinitely  activated?  Since  every  concept  is 
potentially  related  to  every  other  concept,  what  prevents  those  relationships 
from  firing?  Somewhere,  in  a model  of  the  subject’s  performance,  there  must  be 
a task  objective,  rules  governing  the  recruitment  of  skills  and  concepts,  and  tests 
of  their  applicability  and  value. 

As  a way  of  organizing  the  issues,  consider  Fig.  2. 

Any  subject,  performing  any  experimental  task,  is  an  “assembler”  of  his  own 
knowledge,  orienting  skills,  mental  operations  (such  as  comparing  or  inferring) 
and  self-management  tactics.  To  respond  to  a task,  he  must  detect  features  of 
the  situation,  or  summon  feature  memories  of  some  kind.  He  will  (if  he  is  over 
the  age  of  2 or  1)  probably  also  use  word:  and  syntactic  rules.  He  must  call  up 
well-practiced  motor  skills,  such  as  writing  ir  talking.  The  way  he  puts  all  these 
together-his  rules  and  strategies  for  doing  io-coustitute  his  performance  pro- 


FIG.  2 The  structural  components  of  complex  behavior. 
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gram,  one  which  may  or  may  not  match  up  to  the  experimenter’s  theory  of  what 
such  a performance  program  should  entail. 

With  reference  to  the  question:  why  three  systems,  feature,  verbal,  and  action? 
The  answer  is  simply  that  we  seem  to  be  constructed  that  way.  Our  perceptual 
skills,  our  feature-noticing  skills,  are  not  the  same  as  our  linguistic  skills.  We 
cannot  substitute  percept  for  word.  Nor  can  we  substitute  percept  or  word  for 
action  and  related  physiological  phenomena.  This  design  for  humans  has  some 
theoretical  advantages  which  may  or  may  not  have  had  evolutionary  significance. 
For  example,  the  fact  that  a motor  skill  can  run  itself  off  automatically  means 
that  perception  is  freed  for  other  monitoring  operations.  The  fact  that  words  are 
separable  from  percepts  can  produce  discrepancies  that  are  important  discrimina- 
tion cues:  how  else  can  we  learn  that  all  “green  leafy  vegetables”  are  not  lettuce? 
Bruner  (Bruner,  Olver,  & Greenfield,  1966)  has  reviewed  much  additional  evi- 
dence—although  his  own  theoretical  integration  of  it  has  been  somewhat  differ- 
ent from  the  model  shown  in  Fig.  2. 

In  addition  to  these  three  systems,  we  have  a stock  of  special  programming 
operations— problem-solving  heuristics,  classification  abilities,  operations  in  the 
Piagetian  sense,  serializing  capacities,  and  so  forth.  These  are  not  conscious 
strategies,  but  are  fast,  habitual,  management  routines. 

Figure  2 is  essentially  a scheme  for  organizing  and  comparing  complex  behav- 
ior, rather  than  a theory.  It  is  especially  useful  in  the  analysis  of  pedagogical 
situations.  For  example,  consider  the  “White  Sauce  Problem”  in  Chapter  7 by 
Norman,  Centner,  and  Stevens.  What  program  was  the  student  expected  to 
assemble?  Answers  to  questions  about  French  sauces.  What  sort  of  language  was 
involved?  Labels  like  Bechamel,  Veloute,  roux,  as  well  as  familiar  words  like 
sauce,  white,  and  milk.  What  features-actual  perceptual  experiences-were  in- 
volved? None,  in  this  lesson.  (That  was,  Allen  would  no  doubt  have  suggested, 
one  of  the  problems.)  Similarly,  the  involvement  of  the  motor  systei  n-stirring, 
disintegrating  lumps,  gradual  pouring-was  missing  from  the  lesson.  Although  the 
authors  found  the  tutee’s  verbal  confusions  theoretically  interesting,  one  won- 
ders if  they  represented  anything  more  significant  than  the  fact  that  differences 
in  consistency,  coloring,  and  flavoring  among  nembers  of  the  French  sauce 
family  were  not  discriminated,  because  the  necessary  nonverbal  cues  were  never 
experienced. 


CONCLUSIONS 

Overall  then,  what  do  we  know  that  would  really  have  been  new  to  Allen?  Our 
best  theorizing  right  now— and  it  is  very  good  indeed,  as  the  chapters  in  this 
volume  reveal— is  with  reference  to  models  of  competence,  of  knowledge- 
structures.  Because  they  are  dynamic  models,  they  represent  the  human  poten- 
tial for  the  combination  and  recombination  of  ideas,  rather  that  the  static 
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architecture  of  lexicon.  But  we  are  still  not  devoting  enough  theoretical  effort  to 
the  development  of  models  of  performance.  Whether  we  are  addressing  ***  °f 
learning,  recalling,  recognizing,  generalizing,  or  evaluating-we  are  not  yet 
quaUly  modeling  what  subjects  do.  Allen  was  concerned  with  perform^  We 
are  much  better  than  he  was  in  specifying  the  characteristics  of  a well-stocke 
head.  But  he  was  still  ahead  of  us  in  specifying  the  characteristics  of  learning  an 

teaching  as  active  processes. 
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The  chapters  in  this  section  are  very  closely  related.  All  of  than  are  concerned 
with  the  application  of  ideas  of  cognitive  psychology -ideas  still  in  the 
rapid  development— to  the  problems  of  instruction.  For  example,  all  are  con- 
cerned, at  least  in  part,  with  the  processes  by  which  new  information  is  stored 
on  long-term  memory  and  with  the  implications  of  these  processes  for  instruc- 
tion In  pursuing  this  concern,  the  authors  draw  on  what  is  now  one  of  cognitive 
nsvcholoev’s  most  active  areas,  the  modeling  of  semantic  memory.  The  area 
Eludes  work  by  such  authors  as  Quillian  (1969),  Schank  (1972)  Anderson  and 
Bower  (1973),  Kintsch  (1972),  and,  of  course,  the  Norman,  Rumelhart,  an 
LNR  group  (1972),  represented  in  this  volume  by  Norman,  Gentner,  an 
Sevens  Further,  Greeno  makes  use  of  goal  structures  embodied  in  a production 
q/ErT representation  in  a manner  similar  to  recent  work  of  Newell  and  Simon 

( Them  is  necessarily  as  air  of  tentativeness  about  much  that  is  said  in  these 
chapters  because  the  authors  are  attempting  to  point  out  new  directions  rather 
than  simply  giving  better  specification  to  the  old. 

COGNITIVE  OBJECTIVES 

i find  Greeno’s  proposals  especially  exciting.  He  has  taken  of  the > , l sk  ° f 

defining  cognitive  objectives  in  education-objectives  intended  to  replace  the 
Ire  " radical  behavioral  objectives.  To  specify  a behavior  objective  fo 
instruction,  we  state  a particular  set  of  behaviors  we  want  the  students  to  be  able 
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to  perform  after  instruction,  e.g.  to  solve  a specified  class  of  arithmetic  prob- 
lems, or  to  answer  questions  about  a chapter  in  a history  text.  To  specify  a 
cognitive  objective,  we  state  a set  of  changes  we  want  the  instruction  to  bring 
about  in  the  students’  cognitive  processes,  e.g.  acquisition  of  a particular  algo- 
rithm for  division  or  the  assimilation  of  a body  of  historical  fact  to  information 
already  in  long-term  memory. 

Since  Greeno  does  not  discuss  the  relative  merits  of  cognitive  and  behavioral 
objectives  at  any  length,  it  seems  appropriate  to  do  so  here.  One  of  the  most 
important  advantages  of  cognitive  objectives  is  that  they  tend  to  focus  our 
attention  on  the  underlying  cognitive  processes  to  a greater  extent  than  do 
behavioral  objectives.  An  example  from  my  own  teaching  experience  will  illus- 
trate the  point. 

Several  years  ago,  1 was  involved  in  the  teaching  of  elementary  calculus  to  a 
group  of  college  students  who  rated  themselves  “poor”  in  mathematics  ability. 
Some  of  these  students  had  a peculiar  sort  of  difficulty  in  solving  algebra 
problems.  In  problems  where  the  task  was  to  “express  X in  terms  of  Y"  given  a 
set  of  four  or  five  relations,  e.g.  R = Z7 , X =R+  3,  etc.,  a student  might  combine 
relations  and  draw  inferences  without  apparent  pattern.  Student  performance 
improved  considerably  when  they  were  taught  some  elementary  planning  pro- 
cedures for  identifying  what  relations  were  useful  in  a given  problem  and  in  what 
order  they  should  be  considered.  The  difference,  of  course,  lay  not  in  the  answer 
that  the  student  produced  but  rather  in  the  processes  he  used  to  search  for  the 
answer. 

1 do  not  know  with  any  certainty  why  the  students  had  failed  to  learn  the 
planning  procedure  for  solving  such  problems  prior  to  college,  but  the  following 
account  is  at  least  plausible.  For  very  simple  problems  involving  just  two  or  three 
relations,  the  trial  and  error  procedure  may  work  quite  will  in  the  sense  that  the 
problem  requires  no  more  han  two  or  three  trials  before  a successful  solution 
achieved  If,  in  learning  rlgebra,  the  stucents  solved  only  short  problei.  i 
might  by  very  hard  for  the  teacher  to  determine  whether  the  students  were  ng 
trial-and-error  or  some  more  efficient  procedure.  It  would  not  be  surprising  then 
if  at  least  some  students  never  progressed  beyond  trial  and  error.  Now,  it  is  not 
impossible  in  principle  to  deal  with  this  prof  lem  using  behavioral  objectives. 
What  Oi.e  would  need  to  do  is  to  be  sure  that  .he  students  could  solve  problems 
which  resemble  in  all  important  aspects  the  problems  they  will  eventually  be 
required  to  solve.  The  difficulty  lies  in  knowing  what  the  important  aspects  are. 
It  is  exactly  at  this  point  that  knowledge  of  the  underlying  cognitive  processes  is 
important  and  that  the  superiority  of  cognitive  objectives  becomes  most  appar- 
ent. Withe  a'  in  understanding  of  the  cognitive  processes  it  is  often  very  difficult 
to  judge  just  what  sort  of  behavior  we  ought  to  require  of  our  students. 

1 encountered  another  example  illustrating  the  same  point  when  I was  doing 
research  on  the  use  of  visual  imagery  in  elementary  mathematics  (Hayes,  1973). 
One  of  my  subjects  reported  that  he  thought  of  each  of  the  digits  as  having  a set 
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of  points  that  he  used  as  “counting  points.”  For  example,  the  digit  3 had 
three  counting  points  each  located  at  one  of  the  leftward  projections  of  the 
digit,  the  digit  “4”  had  four  counting  points  also  located  at  well  specified 
positions  on  the  digit,  etc.  When  the  subject  added  a pair  of  numbers  such  as  “5” 
and  “7,”  he  would  say  “Five,”  and  then  looking  at  each  of  the  counting  points 
on  the  digit  “7”  in  turn,  say  “six,  seven,  eight, ...”  until  at  the  last  counting 
point,  he  arrived  at  the  answer  “twelve.”  When  1 asked  the  subject  how  he  had 
learned  this  method  of  adding,  he  told  me  that  he  used  to  count  on  the  digits 
with  his  pencil  until  his  teacher  forbade  him  to  do  it.  He  then  switched  to 
performing  the  same  processes  with  eye  fixations.  Clearly,  the  teacher  had 
achieved  her  behavioral  objectives  but  she  had  failed  to  have  any  important 

influence  on  the  underlying  cognitive  processes. 

The  scope  of  the  task  that  Greeno  is  tackling  is  apparent  in  the  set  of  three 
examples  he  used  as  illustrations.  The  first  example  concerns  the  understanding 
of  fractions  by  primary  school  students;  the  second,  understanding  of  psycho- 
physics at  the  college  level.  For  each  example,  Greeno  provides  a model  of  what 
he  believes  to  be  the  underlying  cognitive  processes,  drawing  as  necessary  on 
modern  analysis  of  good  structures,  perceptual  processes,  and  semantic  memory. 
The  details  of  these  models,  of  course,  may  need  to  be  modified  as  more 
knowledge  accumuhtes,  but  the  general  direction  in  which  Greeno  is  heading 

seems  just  right. 


ASSIMILATION  OF  NEW  INFORMATION 

As  we  noted  above,  all  four  of  the  authors  are  concerned,  from  various  points  of 
view  with  the  assimilation  of  new  information  into  memory.  Shaw  and  Wilson 
focus  on  the  logical  structure  of  the  information  being  stored  while  the  others 
focus  on  the  integration  of  new  knowledge  with  information  already  in  memory. 

Shaw  and  Wilson  propose  an  interesting  hypothesis  for  concept  learning.  As  I 
understand  it,  they  propose  that  a set  of  instances  will  be  sufficient  to  allow  a 
subject  to  infer  the  whole  concept  if  the  differences  between  inst-nces  imply  a 
set  of  operators  which  is  a generator  set  for  the  whole  concept.  Actuilly,  Shaw 
and  Wilson  are  not  fully  explicit  about  the  prccess  by  which  operators  are 
inferred  from  instances.  1 have  assumed  that  differences  between  instances  must 
be  important  for  that  process.  An  operator  is  a generator  for  the  whole  concept 
if,  by  applying  it  repeatedly  to  an  instance  of  the  concept,  it  will  generate  all  of 
the  instances  of  the  concept.  For  example,  suppose  that  the  concept  is  the  set  of 
four  90°  rotations  of  a square.  The  pair  of  instances 
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fa  sufficient  to  allow  the  subject  to  infer  the  whole  concept  bectiat : the 
difference  impUes  a 90°  rotation  operator  and  this  operation  is  a «ener 
the  set  of  four  rotations.  However,  the  pair  of  instances 
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process  model  for  schemas. 

DIRECT  training  of  cognitive  skills 
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Norman  says,  “ ...  if  a child  knows  how  to  learn,  then  he  can  get  the  knowledge 
by  himself ...”  and  he  asks,  “Why  do  we  not  attempt  to  teach  some  basic 
cognitive  skills  such  as  how  to  organize  one’s  knowledge,  how  to  learn,  how  to 
solve  problems,  how  to  correct  errors  in  understanding  [p.  OCX)) .” 

Resnick  says  “ ...  it  seems  likely  that  ways  can  be  found  to  make  individuals 
more  conscious  of  the  role  of  environmental  cues  in  problem  solving,  and  to 
teach  strategies  of  feature  scanning  and  analysis  ]pp.  79-80] .”  Such  instructions 
would  enhance  the  likelihood  of  their  noticing  cues  that  prompt  effective  action. 
Similar  ideas  are  being  championed  by  Papert  at  the  Massachusetts  Institute  of 
Technology. 

This  topic  is  dear  to  my  own  heart,  since  Steven  Rosenberg  and  I recently 
designed  and  taught  a course  which  was  intended  to  teach  problem-solving  skills 
to  college  students.  Our  objective  was  to  provide  a course  that  would  help 
students  to  increase  their  problem-solving  skills  by  direct  training.  In  designing 
the  course,  we  made  liberal  use  of  the  work  of  others-notably  of  Polya  (1957) 
and  Wickelgren  (1974)  on  the  teaching  of  problem  solving  and  of  Newell  and 
Simon  (1972)  on  the  analysis  of  problem-solving  skills. 

The  students  who  elected  the  course  were  distributed  through  the  four  colleges 
of  the  University.  There  were  6 students  from  Fine  Arts,  12  from  engineering, 
12  from  science,  and  12  from  humanities  and  social  science.  While  this  mix 
created  difficulty  in  finding  common  ground  on  which  to  discuss  some  topics,  it 
was  in  general  perceived  as  beneficial  both  by  us  as  instructors  and  by  the 
students  because  of  the  diversity  of  points  of  view  which  it  brought  to  class 
discussions. 

The  course  consisted  of  three  sections:  a Diagnostic  section,  lasting  about 
three  weeks,  a Theoiv-Practice  section  of  eight  weeks,  and  a Transfer  section  of 
two  weeks. 

The  Diagnostic  section  was  designed  to  serve  two  functions.  First,  it  was 
designed  to  provide  the  student  with  informal  on  about  the  current  state  of  his 
problem-solving  skills.  Second,  it  was  desigr  ed  to  teach  him  procedures  for 
examining  his  own  problem-solving  processes.  In  all,  five  techniques  were  used  to 
accomplish  these  objectives.  First,  a self-report  form  was  used  to  obtain  the 
students’  own  inventory  of  his  strengths  and  weakness  in  problem  solving. 
Second,  a problem-solving  test  including  a very  wide  variety  of  problems  (logical 
problems,  imagery  problems,  writing  problems,  etc.)  was  administered  and  each 
student  was  given  feedback  about  his  own  performance  as  compared  to  other 
students  in  the  class.  Third,  each  student  was  required  to  record  time  usage  data 
over  one-week  period  as  a means  of  assessing  work  habits.  Mean  values  for 
number  of  hours  spent  in  study  and  in  various  other  activities  were  reported  so 
that  students  could  assess  their  own  performance  against  the  group  mean. 
Fourth,  students  were  asked  to  keep  problem-solving  diaries  in  which  they 
recorded  step  by  step  accounts  of  their  own  problem-solving  behavior  on 
homework  assignments  for  later  analysis.  Fifth,  some  of  the  techniques  of 
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protocol  analysis  were  demonstrated  in  class  and  practiced  in  small  group 
sessions. 

The  theory  practice  section  consisted  of  two  major  activities  which  ran 
concuncntly-a  series  of  lectures  on  the  theory  of  problem  solving  and  the 
students’  skill  improvement  project. 

The  skill  improvement  project  was  the  most  important  single  aspect  of  the 
course.  On  the  basis  of  information  gained  during  the  diagnostic  sectio..,  the 
student  was  expected  to  identify  a skill  that  he  wanted  to  improve  and  to  design 
a project  which  would  improve  it.  Thus,  the  student  had  to  devise  a way  for 
measuring  his  initial  state  of  skill,  generate  a plan  for  improving  the  skill  which 
drew  on  materials  discussed  in  the  course,  and  measure  his  final  state  of  skill. 
Areas  chosen  included  skills  in  logical  problem  solving,  time  management,  chess, 
imagery,  memory,  and  many  others. 

A second  part  of  the  Theory-Practice  section  was  a teaching  experience.  We 
felt  that  by  teaching  a skill  that  he  understood  well,  the  student  would  be 
required  to  do  a careful  analysis  of  that  skill  in  order  to  communicate  it  to 
others.  Student  feedback  would  provide  him  with  information  about  the  ade- 
quacy of  his  analysis.  In  general,  we  found  that  the  teaching  task  worked  quite 
well  as  a pedagogical  device.  Students  reported  that  they  did  come  to  understand 
their  own  skills  better. 

The  third  part  of  the  Theory-Practice  section  was  the  course  of  lectures  which 
ran  concurrently  with  the  skill  improvement  project.  Here  we  attempted  to 
provide  breadth  rather  than  depth  on  the  theory  that  the  interested  student 
could  explore  any  topic  of  special  interest  to  him  in  greater  depth. 

For  example,  the  lecture  topics  included  the  following: 

1.  An  overview  of  problem-solving  techniques  including  trial  and  error, 
learning,  heuristic  search,  planning,  pattern  recognition,  and  a number  of 
other  methods. 

2.  A discussion  of  the  importance  of  representations  in  problem  solving, 
illustrations  of  how  changes  in  representation  can  turn  a difficult  problem 
into  an  easy  one,  and  discussion  of  the  procedures  for  constructing 
representation. 

3.  The  management  of  short-term  memory  including  demonstration  of  the 
constraints  imposed  on  problem  solving  by  the  limitations  of  short-term 
memory,  and  the  demonstration  of  techniques  for  avoiding  these  limita- 
tions. 

4.  The  importance  of  long-term  memory  (for  example,  of  factual  world 
knowledge)  for  problem  solving  and  techniques  for  storing  information. 

5.  The  nature  of  rule  induction  and  some  techniques  for  inducing  rules. 

6.  The  nature  and  use  of  hypothetical  reasoning. 

7.  Techr  ques  for  decision  making  and  the  limitations  of  the  human  as  a 
decision  maker. 
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8.  The  nature  of  planning  and  its  importance  in  problem  solving. 

9.  Perceptual  processes  and  imagery  m tasks  such  as  chess  and  mental 
arithmetic. 

10.  The  function  of  mathematical  notation  in  problem  solving. 

In  the  final  section  of  the  course,  the  students  were  asked  to  identify  what,  if 
anything,  they  had  learned  in  the  course  that  was  applicable  to  their  own  area  of 
special  interest,  and  to  report  that  learning  in  form  of  concrete  instances.  It  was 
here  that  we  hoped  that  both  we  and  the  students  would  learn  in  what  respects 
the  course  was  of  practical  use  and  in  what  respects  if  failed  in  its  l jjectives.  In 
general,  the  results  were  quite  encouraging  as  we  will  see  below. 


EVALUATION  OF  COURSE  EFFECTIVENESS 

To  evaluate  the  effectiveness  of  a course  of  this  *-rt  is  rather  difficult  since  the 
students  wore  working  in  a number  of  diverse  directions.  Nevertheless,  by 
examining  the  student  improvement  projects  and  transfer  reports,  we  cim  <n- 
tify  three  general  areas  in  which  students  report  definite  improvements  in 
problem-solving  skills. 

1 . Improvements  due  to  increased  awareness  of  own  cognitive  processes. 

2.  Improvements  due  to  increased  diagnostic  skills. 

3.  Improvements  in  generalize  problem  solving  skills. 

One  of  the  most  influential  aspects  of  increased  awareness  of  '.v  : cognitive 
processes  appeared  to  be  the  distinction  between  trial  and  error  a>  ,zt  types 
of  solution  procedure.  Four  of  the  students  commented  indepeu , r y that  the 
distinction  had  clarified  their  thinking  about  prob'em  solving.  One,  an  electrical 
engineer  said  "...  my  self  improvement  project  taught  me  that  although  it  is 
not  esthetically  pleasing  to  me,  trial  and  error  solutions  arc  sometimes  more 
efficient.  I have  used  this  knowledge  to  more  effectively  solve  problems  in  my 
fiance  and  marketing  courses.”  Ano’her  electrical  engineer  commented  “ . . . I 
have  always  used  hypothetical  reasoning  to  some  extent  but  never  realized 
exactly  what  it  was.” 

Three  of  the  psychology  students  applied  diagnostic  skills  they  had  learned  tr, 
class  to  the  process  of  analyzing  case  studies.  Reading  through  a case  study  line 
by  line,  they  recorded  their  current  judgement  as  to  what  was  important  and 
what  was  not  important  as  they  preceded.  Then,  they  read  the  case  study  again 
and  recorded  their  importance  judgements  a second  time.  They  used  the  differ- 
ence between  the  first  and  second  judgements  as  a clue  to  the  dete.minants  of 
their  clinical  judgements. 

Improvements  in  problem  solving  disc  to  generalized  problem-solving  skills 
appear  in  a number  of  forms.  A psychology  major  used  imagery  techniques  to 
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improve  her  ability  to  memorize.  Applying  her  skill  to  the  game  of  concentra- 
tion, she  became  good  enough  so  that  her  friends  refused  to  play  with  her.  More 
practically  she  applied  the  technique  to  remembering  appointments,  shopping 
lists,  learning  the  Greek  alphabet. 

Several  students  reported  gains  due  to  improvements  in  planning  and  organiza- 
tion. A physics  student  said,  “Working  problems  in  tliis  course  has  helped  me  to 
work  problems  in  physics,  not  because  they  are  the  same  types  of  problems, 
which  they  are  not,  but  because  of  the  orderly  methods  1 have  developed  as  a 
result.”  A psychology  student  reported,  “The  first  skill  I was  able  to  transfer  was 
being  able  to  clearly  identify  the  problem.  Later  the  same  student  said,  “In  both 
tasks,  I set  up  the  problem,  pointed  out  what  it  was  1 was  going  to  solve,  and 
then  in  an  organized  manner  found  out  the  details  that  were  essential  to  solve 
the  problem.”  A math  student  reported,  “Outside  of  math,  1 found  planning 
good  in  writing  papers  for  a history  class.  It  was  the  first  time  I had  ever  been 
able  to  successfully  use  an  outline  to  write  a paper.” 

A chemistry  student  reported,  “There  were  other  ways  of  improving  my  labs 
through  time  management.  For  example,  there  was  one  U.V.  spectrophotometer 
available  for  the  third  experiment  and  everyone  had  to  use  it.  By  doing  the  third 
experiment  first,  I was  the  only  one  on  the  machine  and  didn’t  have  to  wait  in 
line.”  “ ...  In  this  way,  1 finished  up  my  set  of  experiments  a week  and  a half 
ahead  of  everyone  else.” 

Six  students  reported  applications  of  knowledge  about  perceptual  processes 
and  imagery.  A music  major  analyzer  ..ie  structure  of  musical  notation  in 
relation  to  pattern  recognition  processes  in  reading  music.  A physics  major 
improved  his  ability  to  visualize  complex  molecules  in  biochemistry.  Several 
electrical  engineers  noted  the  close  analogy  between  the  recognition  of  patterns 
in  chess  and  the  recognition  of  patterns  in  analysis  of  circuit  diagrams. 

Three  students  reported  that  they  had  used  decision  making  procedures 
discussed  in  class  in  practical  application— one  to  choose  an  apartment,  one  to 
choose  a graduate  school,  and  one  to  identify  the  winner  in  a Miss  America 
contest. 

While  results  such  as  these  are  complex  and  incomplete,  and  probably  contami- 
nated by  the  fact  that  students  often  like  to  say  encouraging  things  to  their 
teachers,  we  aie,  nonetheless,  encouraged  that  our  course  really  did  help  the 
majority  of  our  students  both  by  increasing  their  knowledge  of  their  problem 
solving-processes  and  by  providing  them  with  new  problem  solving  skills. 

In  summary,  the  majoi  themes  in  the  chapters  we  have  leviewed- 
establishment  of  cognitive  objectives  for  instruction- direct  training  m cognitive 
skills-study  of  the  assimilation  of  information  to  memory  schemes-constitute 
important  directions  for  the  interaction  of  cognitive  psychology  with  instruc- 
tion. Clearly  it  will  be  many  years  before  the  details  are  woiked  out.  When  they 
are,  however,  both  cognitive  psychology  and  the  science  of  instruction  will  be 
richer  for  it. 


Part 


FUNDAMENTAL  PROCESSES 
IN  COMPREHENDING 
AND  UNDERSTANDING 
INSTRUCTIONS 


I must  confess  that  a man  is  guilty  of 
unpardonable  arrogance  who  concludes,  be- 
cause an  argument  has  escaped  his  own  in- 
vestigation, that  therefore  it  does  not  really 
exist.  1 must  also  confess  that,  though  all 
the  learned,  for  several  ages,  should  have 
employed  themselves  in  fruitless  search 
upon  any  subject,  it  may  still,  perhaps,  be 
rash  to  conclude  positively  that  the  subject 
must  therefore  pass  all  human  comprehen- 
sion. Even  though  we  examine  all  the 
sources  of  our  knowledge  and  conclude 
them  unfit  for  such  a subject,  there  may  still 
remain  a suspicion  that  the  enumeration  is 
not  complete  or  the  examination  not  accu- 
rate [Hume,  1748/1955). 
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The  main  medium  for  the  acquisition  of  knowledge  is  probably  verba!  compre- 
hension. The  central  importance  of  comprehension  skill  is  recognized  by  our 
educational  institutions,  and  therefore  comprehension  is  often  used  as  a criterion 
skill  for  measuring  achievement  and  aptitude.  In  this  chapter,  we  will  report  on 
our  investigation  of  one  aspect  of  verb?i  comprehension,  namely,  the  mental 
processes  that  underlie  sentence  comprehension.  Our  research  focuses  on  the 
information  a person  extracts  from  a sentence,  on  the  internal  representation  of 
that  information,  and  on  the  mental  orations  that  are  applied  to  the 
representation.  Our  aim  is  to  specify  the  parameters  of  the  information- 
processing system  in  simple  comprehension  tasks.  We  will  validate  our  theoreti- 
cal proposals  by  accounting  for  response  latencies  in  a task  where  people  decide 
whether  a sentence  is  true  or  false.  Then  we  will  examine  verbal  comprehension 
in  a number  of  other  tasks,  showing  how  the  same  fundamental  processes  are 
common  to  these  various  situations. 

This  chapter  consists  of  three  sections.  Fir.'t,  we  will  outline  an  information- 
processing model  that  accounts  for  response  latencies  in  verifying  simple  and 
emOvd'W  affirmative  and  negative  sentences  in  which  the  negative  sentences 
contain  the  explicit  negative,  not.  Second,  we  will  show  tin!  the  same  model 
explains  how  people  interpret  simple  instructions  that  contain  implicitly  nega- 
tive lexical  items  like  except,  different,  and  forget.  Thrd,  we  will  examine  two 
tasks  that  occur  in  educational  tests  of  verbal  comprehension-sentence  comple- 
tion and  reading  comprehension-in  order  to  show  how  performance  in  these 
tasks  can  be  analyzed  within  the  same  theoretical  framework. 

Preceiling  page  Hank 
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AN  INFORMATION-PROCESSING  MODEL 
OF  SENTENCE  VERIFICATION 

The  Internal  Representation 

Understanding  a sentence  involves  internally  representing  the  information  that 
the  sentence  contains.  It  is  likely  that  the  format  of  the  internal  representation  is 
propositional,  a relational  structure  consisting  of  a predicate  and  one  or  more 
arguments.  We  will  use  the  conventional  notation,  (, PREDICATE , ARGUMENT) 
to  denote  a proposition.  In  this  notation  part  of  the  representation  of  a simple’ 
declarative  sentence  like  77ie  dots  are  red  is  (RED,  DOTS),  meaning  redness  is 
predicated  of  the  dots.  Since  predications  can  be  affirmed  or  negated,  the  entire 
representation  of  this  affirmative  sentence  is  (AFF,  (RED,  DOTS)).  A negative 
sentence  like  The  dots  aren 't  red  is  represented  as  (NEC,  (RED,  DOTS)).  AFF 
and  NEC  are  embedding  markers  that  denote  the  affirmative  or  negative 
polarity  of  the  predication.  This  form  of  representation  allows  us  to  combine 
simple  propositions  to  represent  more  complex  sentences.  For  example.  It  is 
fortunate  that  the  dots  are  red  can  be  represented  as  (FORTUNA  TE,  P),  where  P 
is  the  simple  proposition  (AFF,  (RED,  DOTS)). 

The  internal  representation  of  a sentence  is  not  necessarily  linguistic  in  nature. 
The  verbal  symbols  in  these  representations,  for  example,  DOTS,  are  used  to 
denote  more  abstract  entities.  In  fact,  research  on  sentence-picture  verification 
suggests  that  there  may  be  a level  of  representation  that  is  neither  linguistic  nor 
pictorial  in  nature  but  can  represent  information  from  either  domain  (Chase  & 
Clark,  1972;  Clark  & Chase,  1972).  For  example  a pictur*  of  red  dots  may  be 
represented  (RED,  DOTS). 

The  detailed  form  of  the  representation  of  various  kinds  of  sentences  has  not 
yet  been  empirically  verified.  For  example,  the  research  on  the  linguistic  factors 
that  determine  the  psychological  predicate-argument  structure  has  only  begun 
(cf.  Halliday,  1967;  Hornby,  1972).  Moreover,  there  are  cases  where  various 
representations  are  formally  equivalent  and  the  selection  of  one  particular  form 
is  really  arbitrary.  Nevertheless,  this  conventional  notation  is  sufficient  for  the 
current  model  and  promises  to  be  flexible  enough  to  accommodate  a variety  of 

linguistic  structures  (cf.  Kintsch,  1972). 

The  same  sentence  may  be  represented  differently  in  different  situations.  This 
follows  from  the  resumption  that  the  representation  contains  the  information 
that  a person  ex’n.cts  from  a sentence.  What  information  is  extracted  depends 
on  the  preceding  sentences,  the  situation  in  which  the  sentence  is  embedded,  and 
the  listener’s  previous  knowledge.  The  role  that  context  plays  in  how  a sentence 
is  represented  suggests  an  important  distinction  between  the  psychological 
notion  of  an  internal  representation  and  the  traditional  linguistic  notion  of  deep 
structure. 
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The  Task 

We  have  recently  proposed  a model  to  account  for  the  mental  processes  under- 
lying the  verification  of  affirmative  and  negative  sentences  (Carpenter  & Just, 
197S).  The  situation  that  originally  gave  rise  to  the  model  is  a simple  task  in 
which  a person  must  decide  whether  a sentence  is  true  or  false  of  a picture.  For 
example,  Just  and  Carpenter  (1971)  presented  sentences  li'.e  The  dots  are  red  or 
The  dots  aren  7 red,  as  well  as  pictures  of  red  dots  or  of  , lack  dots.  Thus,  the 
sentences  could  be  affirmative  or  negative,  and  true  or  false.  A person  was  first 
shown  the  picture,  and  then  timed  as  he  or  she  read  the  sentence  and  decided 
whether  it  was  true  or  false.  The  results  of  this  study  showed  that  it  took  longer 
to  verify  negative  sentences  than  affirmative  sentences  by  a certain  amount  of 
time,  called  negation  time.  In  this  particular  study,  the  negation  time  was  a little 
more  than  two-fifths  of  a second.  The  study  also  showed  that  affirmative 
sentences  were  verified  faster  when  they  were  true  than  when  they  were  false, 
while  negative  sentences  were  verified  faster  when  they  were  false.  The  diffe: 
ence  in  verification  time  between  the  true  and  false  sentence  was  opposite  in  sign 
but  equal  in  magnitude  for  affirmative  and  negative  sentences.  This  time,  called 
falsification  time,  was  a little  more  than  one  fifth  of  a second  in  this  study. 
These  two  results,  the  latency  advantage  of  affirmative  sentences,  as  well  as  the 
interaction  between  affirmative-negative  and  true-false,  vere  also  found  in  a 
number  of  previous  studies  (Chase  & Clark,  1972;  Clark  & Chase,  1972;  Gough, 
1965,  1966;  McMahon,  1963;Trabasso,  Rollins,  & Shaughnessy,  1971 ; Wason  & 
Jones,  1963). 


The  Mental  Operations 

In  this  section  we  will  outline  a model  that  accurately  predicts  the  verification 
times  for  these  simple  affirmative  and  negative  sentences.  We  will  show  that  it 
also  accounts  for  the  verification  of  embedded  sentences.  Moreover,  the  main 
features  of  the  model  will  serve  as  a basis  for  our  examination  of  con. prehension 
processes  in  instructional  situations. 

The  main  focus  of  the  model  is  on  the  operations  that  compare  the  sentence 
and  picture  representations.  The  model  postulates  that  the  corresponding  con- 
stituents from  the  two  representations  are  retrieved  and  compared,  pair  by  pair. 
Moreover,  the  number  of  these  retrieve  and  compare  operations  is  assumed  to  be 
the  primary  determinant  of  the  pattern  of  verification  times.  Figure  1 shows  a 
flow  chart  for  the  proposed  process.  The  prepositional  structure  of  the  represen- 
tations provides  an  ordering  relation  on  the  constituents.  This  ordering  deter- 
mines the  sequence  in  whxh  constituents  are  compared.  Inner  propositions  are 
compared  before  polarity  markers.  An  AFF  marker  in  a sentence  representation 
is  assumed  to  match  the  absence  of  a marker  in  the  picture  representation,  since 


248 


M.  A.  JUST  AND  P.  A.  CARPENTER 


pictures  are  generally  encoded  affirmatively.  The  “find  and  compare”  process  is 
a serial,  iterative  operation  that  can  be  applied  to  representations  with  multiple 
embeddings.  This  iterative  operation  will  allow  the  model  to  be  generalized 
without  additional  assumptions. 

The  central  assumption  is  that  whenever  two  corresponding  constituents  from 
the  sentence  and  picture  representations  mismatch,  then  the  entire  comparison 
process  is  reinitiated.  To  prevent  the  process  from  looping  forever  on  mismatch- 
ing constituents,  we  assume  that  the  first  time  a mismatch  is  discovered,  the  two 
constituents  involved  are  tagged,  so  that  on  subsequent  recomparisons  the  two 
will  be  treated  as  a match. 

Since  mismatches  cause  the  comparison  process  to  be  reinitiated,  the  total 
number  of  comparison  operations,  and  consequently  the  total  latency,  increases 
with  the  number  of  mismatches.  Moreover,  a mismatch  that  occurs  later  in  the 
comparison  process  results  in  more  recomparisons  than  a mismatch  on  earlier 
constituents.  Thus,  the  total  latency  is  a function  of  both  the  number  of 
mismatches  and  their  locus  in  their  respective  representations. 

A response  index  records  the  matches  and  mismatches  between  constituents. 
The  ir  dex  has  two  possible  states,  true  and  false.  At  the  beginning  of  each  trial, 
its  initial  state  i'»  true,  and  each  mismatch  causes  it  to  change  its  state.  The  time 
spent  in  changing  the  response  index  (and  tagging  mismatching  constituents)  is 
assumed  to  be  negligible  relative  to  the  time  to  perform  the  find  and  compare 
operation. 
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TABLE  1 

Rap  mentations  and  Predictions  for  the 
Four  Information  Conditions® 


True  affirmative 

False  affirmative 

Sentence: 

Picture: 

Sentence  representation: 
Picture  representation: 

The  dots  are  red. 

Red  dots 

(AFF,  (RED,  DOTS)) 
(RED, DOTS) 

+ + 

The  dots  are  red. 

Black  dots 

(AFF,  (RED,  DOTS) ) 
(BLACK,  DOTS) 
- index  = false 

response  = true 
k comparisons 

+ + 
response  - false 
k + 1 comparisons 

False  Negative 

True  Negative 

Sentence: 

Picture: 

Sentence  representation: 
Picture  representation: 

The  dots  aren’t  red.  The  dots  aren  t red. 

Red  dots  Black  dots 

(NEC,  (RED,  DOTS) ) (NEC,  (RED,  DOTS) ) 
(RED,  DOTS)  (BLACK,  DOTS) 

+ index -false  - index  = false 

+ + + index  = true 

response  = false 
k * 2 comparisons 

+ + 
response  = true 

“Plus  and  minus  sign:,  denote  matures  miu  mum—  - ' . ... 

constituents.  Each  horizontal  line  of  plus  and  minus  signs  indicates  a reinitia- 
tion of  the  comparison  process. 

WVn  the  model  is  applied  to  the  proposed  sentence  and  picture  represen- 
tations in  the  Just  and  Carpenter  experiment,  it  can  account  for  the  latencies  in 
the  four  conditions.  In  the  simplest  case,  the  true  affirmative  there  are  no 
mismatches  between  the  sentence  and  picture  representations,  as  shown  m Table 
1.  The  first  compr.ison,  between  the  inner  propositions,  results  in  a match,  lhe 
second  compariscn,  between  polarity  markers,  also  results  in  a match  (Recall 
that  AFF  markei  in  the  sentence  representation  is  presumed  ti  match  e 
absence  of  any  polarity  marker  in  the  picture  representation).  Thus  if  ter  a i tote 
of  two  constituent  comparisons,  the  truth  index  is  still  set  to  true , an  s 
response  is  executed.  The  number  of  constituent  comparisons  in  the  true 
affirmative  case  serves  as  the  base  line  for  the  other  conditions,  and  will  be 
referred  to  as  k.  Here,  k equals  2. 

In  the  false  affirmative  condition,  the  inner  propositions  of  the  sente  ice  and 
picture  mismatch.  The  mismatch  will  reinitiate  the  comparison  process,  causing 
one  extra  comparison  above  the  base  number.  Table  2 shows  the  consequences 
of  this  mismatch  in  detail.  The  mismatching  constituents  are  tagged  and  ihe 
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TABLE  2 

A Trace  of  the  Operations  in  Verifying  a False  Affirmative 

Stimulus  sentence:  The  dots  are  red. 
Stimulus  picture:  A set  of  black  dots 


Operations 


Initialize  response  index  to  true 
Represent  sentence 
Represent  picture 

1.  Compare  fust  constituents 

Tag  sentence  constituent 
Tag  picture  constituent 
Change  index  to  false 
Reinitialize  comparison  process 

2.  Compare  fust  constituents 

2. Compare  second  constituents 
Respond  with  content  of  index 
Number  of  comparisons 


(AFF,(RtD,  DOTS)) 
(BLACK,  DOTS) 

(AFF,  ( M )) 

( M ) 


+ 


+ 

False 

k + 1 , where  k = 2 


response  index  is  set  to  false.  After  the  reinitiation,  the  tagged  inner  onstitue its 
are  compared,  and  they  match.  The  next  comparison,  between  the  po  anty 
markers  also  results  in  a match.  So,  the  response  false  is  executed  after  a total 

k + 1 constituent  comparisons.  , . 

In  the  false  negative  condition,  there  will  be  a total  of  k + 2 comparisons,  d 
to  the  mismatch  on  the  second  constituent,  the  polarity  markers.  This  mismatch 
wil  cal  the  response  false  to  be  executed.  For  the  true  negative  cond.tmn, 
both  the  first  and  the  second  constituents  mismatch,  so  that  response 
will  be  executed  after  a total  of  k + 3 constituent  comparisons.  Both 
are  summarized  in  the  bottom  half  of  Table  1. 

The  model  postulates  that  verification  latencies  should  be  a direct  function  o 
J number  of  constituent  comparisons.  The  number  of  comparisons  and  hence 
the  latency,  should  increase  linearly  from  true  affirmative  (k  , to  false  a^firma- 
..  (V  + n m false  neaatives  (k  + 2),  to  true  negatives  (k  + S). 

T , '1  of i well  as  .he  bee,  flu*  1 «*  » 
shown  in  Fig.  2.  The  predictions  of  the  model  fit  the  data  quite  well.  The  model 
accounts  for  98.0%  of  the  variance  among  the  four  means.  The  s ope  is  m 

^0  nly1  S<i>  n e 6p  a i a me  teT,  he  time  to  find  and  compare  a pair  of  constituents,  is 
necessary  to  characterize  the  processing  in  these  four  conditions,  w ere 

(Carpenter  & Just,  1975),  we  1^ 

and  shown  that  it  can  also  account  for  the  latencies  in  y 
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(TA)  (FA)  (FN)  (TN) 

NUMBER  OF  CONSTITUENT  COMPARISONS 

FIG.  2 The  fit  of'the  constituent  comparison  model  for  the  four  information  conditions. 
(Data  from  Just  &.  Carpenter,  1971,  Exp.  II). 

experiments  (i.e.,  Clark  & Chase,  Expts.  I,  II,  & III,  1972;  Gough,  1965,  1966, 
Expt.  II;  McMahon,  1963,  Trabasso  et  at.,  Expt.  IX,  1971).  Thus,  the  current 
model  provides  a parsimonious  explanation  of  performance  in  these  tasks. 
Although  parsimony  is  desirable,  the  model  should  be  evaluated  on  the  basis  of 
its  ability  to  offer  a rigorous  formulation  that  is  both  a plausible  mental  process 
and  can  incorporate  a wide  variety  of  empirical  results.  This  ability  will  be 
demonstrated  in  each  of  the  following  sections.  The  nature  of  the  propositional 
representation,  particularly  the  embedding  feature,  should  allow  the  model  to  be 
generalized  to  more  complex  sentences  without  additional  modifications.  This 
property  of  the  model  was  tested  by  examining  embedded  affirmative  and 
negative  sentences. 


A Test  of  the  Model 

To  further  test  the  model,  the  serpe  of  the  negative  was  systematically  varied. 
Other  factors,  like  the  sentence  length  and  the  picture,  were  kept  constant.  The 
scope  of  a negative  is  defined  as  the  range  of  constituents  to  which  it  applies 
(Jackendoff,  1969;  Klima,  1964).  The  affirmative  sentences  used  in  the  experi- 
ment included  the  superordinate  clause  It  is  true  that . . . (e.g.,  ft  is  true  that  the 
dots  are  red ) and  could  be  negated  in  two  ways.  With  one  type  of  negation,  the 
negative  has  a small  scope,  namely  the  inner  predication:  It  is  true  that  the  dots 
aren’t  red.  This  will  be  called  predicate  negation.  The  second  type  of  negation 
has  a larger  scope  since  the  negative  is  in  the  superordinate  clause  where  it 
applies  to  the  entire  inner  proposition:  It  isn't  true  that  the  dots  are  red.  This 
type  of  negation  will  be  called  denial.  Denials  should  take  longer  to  process  than 
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, occur  on  a constituent  that  is 

predicate  negatives  because  the  mism  d exarnining  the  hypothe- 

compared  later.  The  exact  prediction  can  with  the 

sized  representations  for  these  sentence  types  and 

comparison  process  lit.  true  that  the  dots  are 

The  representation  of  an  affir “"TtS dotlZed.  namely  ,(AFF, 
red  may  be  the  same  as  for  t e S“"P  b dding  clause  It’s  true...  does  not 
(RED,  DOTS))  The  rationale  is  thatthee  g ^ this  point, 

change  the  truth  value  and  so  it  ^ ^ e /f-s  true  that  it’s  true  that 

consider  a concatenation  of  this  W ’ embedding  clauses  is  irrelevant 

it’s  true...  the  dots  are  red.  The  number  JUmg  affirmative 

to  the  truth  valU^°d  | ^/^'the' representation  of  a predicate  negative 
proposition  may  be  deleted  irom  v representation  would 

sentence  like  It’s  true  that  the  dots  ,P  den,al  like  It  isn’t 

be  (NEC,  (RED.  DOTS)).  embedded  clause.  Here  the 

true  that  the  dots  are  red  must  includ  B Qf  ^ sentence.  Thus,  denial 

embedding  clause  does  affect -the  DOTS))).  The  pictures 

sentences  might  be  represented  Idee  (NEC,  (A  ^ ooTS)  or  (BLACK, 

saw 

true  or  false  of  the  picture.  There  were  derjved  by  examining  the  flow  chart 

The  predicitions  of  the  model  « predicted  number  of 

model  in  Fig.  1 and  the  ^ . f2«  ^ 

operations  necessary  to  ven  y a predicate  negative  it  is  * + 

+ i ; for  a false  predicate  -f  ^ ^^^inial  L k ♦ 5.  The  verification 
3;  for  a false  denial  it  is  * + 4 a ed  number  of  operations.  A 

latencies  should  mcrease  hneariy  with  P ^ wi„  constitute  strong 

- - — °f  a s,ngle 

With  the  number  of  hypothesize  for  each  additional  constituent 

latencies  increased  an  averaf  ° JJ  ‘figure  3 shows  this  result,  along  with 
comparison  (Standard  Error  - 23  ms  f for  97.7%  of  the  variance 

the  best  fitting  straight  line.  p > .01.  The  residual  2.3%  is  not 

among  the  six  means,  F(  > ’ n ared  deviation  (RMSD)  of  52 

significant,  F( 4,  1 15)  = 1 . parameter.  This  analysis  confirms  the 

msec  is  small  relative  to  the  200  ms  P with  the  number  of 

major  hypothesis  that  verification  ,me  increases 

constituent  comparisons  correlated  with  the  latencies  (r  = 

The  error  , rate  for  the  » “ indict*  ttat  the  probability  of  mor 

.98),  as  is  shown  m rig.  inis  cu 


TABLE  S 

Representation*  end  Predictions  for  tfe  Six  Information  Conditio ns“ 


True  affirmative 

False  affirmative 

Sentence: 

It’s  true  that  the  dots  are  red. 

It’s  true  that  the  dots  are  red. 

Picture: 

Red  dots 

Black  dots 

Sentence  representation: 

(AFF,  (RED,  DOTS)  ) 

(AFF,  (RED,  DOTS) ) 

Picture  representation: 

(RED,  DOTS) 
+ ♦ 

(BLACK,  DOTS) 

- index  - false 

response  = true 

+ + 

k comparisons 

response  = false 
k + 1 comparisons 

False  predicate  negative  True  predicate  negative 


Sentence: 

It’s  true  that  the  dots  aren’t  red. 

It’s  true  that  the  dots  aren’t  red. 

Picture: 

Red  dots 

Black  dots 

Sentence  representation: 
Picture  representation: 

(NEG,  (RED,  DOTS)  ) 
(RED,  IXTS) 

-*•  index  = false 

(NEG,  (RED,  DOTS) ) 
(BLACK,  DOTS) 

- index  = false 

+ + 

+ index  = true 

response  = false 

+ + 

it  + 2 comparison 

response  ■ true 
k + comparisons 

False  denial 


True  denial 


Sentence: 

Picture: 

Sentence  representation: 
Picture  representation: 


It  isn’t  true  that  the  dots  are  red. 
P.ed  dots 

(NEG,  (AFF,  (RED,  DOTS) ) ) 
(RED,  DOTS) 

+ + index  = false 

+ + 

response  = false 
it  + 4 comparisons 


It  isn't  true  that  the  dots  are  red. 
Black  dots 

(NEG,  (AFF,  (RED,  DOTS) ) 
(BLACK,  DOTS) 

- index  = Use 
_ + + index  = true 

+ + + 

response  true 
k + 5 comparisons 


optus  and  minus  signs  denote  matches  and  mismatches  of  the  corresponding  -nstituents.  Each 
horizontal  line  of  plus  and  minus  signs  indicates  a reinitiation  of  the  comparison  process. 
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NUMBER  OF  CONSTITUENT  COMPARISONS 


FIG.  3 The  fit  of  the  constituent  comparison  model  for  the  six  information  conditions. 


increases  with  the  number  of  hypothesized  operations.  This  suggests  that  the 
probabilities  of  error  in  the  comparisons  are  additive. 

The  model  is  able  to  predict  the  processing  time  for  these  six  conditions  on  the 
basis  of  a single  parameter:  the  time  to  find  and  compare  a pair  of  constituents. 
These  results  strongly  support  the  hypothesis  that  a single  iterative  operation 
accounts  for  the  processing  of  affirmative  and  negative  sentences.  The  embedded 
representation,  combined  *.  ith  the  iterative  comparison  operation,  allow  the 
model  to  account  for  the  no  scopes  of  negation  without  additional  assump- 
tions. 

A further  control  study  showed  that  the  representation  and  processing  of  the 
sentence  is  determined  by  its  semantics  rather  than  by  its  surface  structure.  To 
show  this,  we  compared  the  processing  of  sentences  that  had  the  same  con- 
stituent structure  but  different  surface  structures.  In  this  control  study,  the 
inrer  propositions  of  the  sentences  were  embedded  in  two  ways:  the  same  way 
as  the  previous  experiment  (e.g.,  It’s  true  that  the  dots  aren’t  red)  and  with  the 
embedding  clause  at  the  end  of  the  sentence  (e.g..  That  the  dots  aren’t  red  is 
true).  Both  of  these  kinds  of  sentences  are  postulated  to  hav  the  same  con- 
stituent structure.  However,  the  position  of  the  negative  in  the  surface  structure 
has  been  changed.  If  the  results  of  the  basic  experiment  can  be  replicated  with 
the  new  sentences,  then  the  results  cannot  be  due  to  position  of  the  negative 
morpheme  in  the  surface  structure.  The  new  stimulus  sentences  were:  Affirma- 
tive- That  the  do's  are  red  is  true;  Predicate  negative- 77w/  the  dots  aren 't  red  is 
true;  and  Denial-  That  the  dots  are  red  isn’t  true. 
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Ihu  control  study  showed  the  two  types  of  surface  structures  were  processed 
similarly.  Both  types  of  sentences  showed  a .near  increase  in  latency  as  the 
.lumber  of  comparisons  increased.  Regardless  of  whether  the  negative  morpheme 
occurred  near  the  beginning  or  end  of  the  sentence,  denials  took  about  500  msec 
lo/iger  to  verify  than  predicate  negatives.  This  rtsult  shows  that  t?ie  underlying 
constituent  structure  rather  than  order  of  negatives  in  the  surface  structure 
determines  processing  time,  and  constitutes  further  support  for  the  proposed 
representations  for  the  two  kinds  of  negative  sentences. 

The  mental  processes  described  by  this  model  ate  not  specific  to  the  sentence- 
verification  paradigm,  but  occur  in  a large  number  of  situations  that  involve 
verbal  comprehension.  These  more  general  processes  involve  relating  the  infor- 
mation from  a sentence  to  information  from  a second  sovree.  such  as  the 
listener’s  previous  knowledge  of  the  world.  For  example,  in  order  to  agree  or 
disagree  with  a statement,  it  is  necessary  to  compare  the  statement  to  a 
representation  of  one’s  own  belief.  In  order  to  answer  a Wh  question  (e.g.,  K'bo 
painted  the  fence),  the  information  provided  in  the  question  (e.g.  rhat  someone 
painted  the  fence)  must  be  compared  to  previous  knowledge  before  the  interro- 
gated constituent  can  be  retrieved.  In  order  to  acquire  new  information  through  a 
verbal  medium,  the  old  information  in  the  communication  will  serve  as  a basis  to 
which  the  new  information  is  added.  The  determination  of  which  information  is 
old  can  only  be  nude  if  the  sentence  representation  is  compared  to  previous 
knowledge.  In  the  next  section  of  this  chapter,  we  vnll  show  that  these  compari 
son  operations  also  occur  when  we  follow  simple  instruction.  Thus,  the  basic 
kinds  of  operations  described  by  the  model  are  part  of  a large  class  of  compari- 
son operations  that  occur  very  commonly  when  we  comprehend  linguistic 
material. 

The  mental  operations  dearth*  d by  the  model  are  not  specific  to  the  process- 
ing of  explicitly  negative  sente  ces.  but  rather  they  occur  in  the  processing  of  a 
variety  of  semantic  structures,  blsewhere,  we  ha'T  shown  how  the  model 
accounts  for  semantic  structures  such  »s  negative  quantifiers  like  few.  paiticular 
and  universal  quantifiers  like  some  and  all  (Just,  1974),  counterfaclual  clauses 
like  Mary  would  has*  left . . (Carpenter,  1973),  and  active  and  passive  sen- 
tences like  The  car  hit  the  truck  and  The  truck  was  hit  bv  the  car  {cf  Carpenter 
A Just.  1975).  Next  we  will  show  how  the  model  also  accounts  for  tlie 
processing  of  inst'uctions  that  contain  implicitly  negative  predicates. 


COMPREHENDING  IMPLICITLY  NEGATIVE  INSTRUCTIONS 

A number  of  predicates  like  forget,  thoughtless,  disagree,  and  absent  arc  consid- 
ered implicitly  negative  (cf  Clark,  in  press,  Just  St  Carpenter,  1971  Klima, 
1964)  For  example,  we  may  define  forgot  as  did  i't  remember  or  we  may  think 
of  absent  as  not  present,  and  so  on  By  contrast,  we  do  not  generally  think  of 
remembered  as  didn't  forget  This  suggests  that  there  may  be  an  asvmmetry  in 
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how  we  internally  represent  pairs  of  lexical  items  like  remember  and  forget; an 
implicitly  negative  item  like  forget  may  be  internally  represented  as  a negation 
of  remember.  This  hypothesis  can  be  tested  by  examing  the  data  from  a number 
of  comprehension  studies  that  have  used  such  implicitly  negative  predicates. 
Two  types  of  studies  provide  relevant  data.  The  fait  type  involves  sentence 
verification  tasks  where  the  stimuli  contained  implicitly  negative  predicates.  In 
the  second  type  of  study,  the  implicit  negatives  were  in  the  instructions  given  to 
the  subject.  If  predicates  like  forget  are  represented  as  negatives,  then  their 
processing  should  conform  to  the  constituent  comparison  model 


Renriecnber-Forget 

The  implicitly  negative  predicate  forget  to  presents  an  interesting  opportunity 
for  examining  the  comprehension  of  negation.  Not  only  is  this  predicate  nega- 
tive, but  the  proposition  embedded  in  it  is  also  negative.  For  example,  the 
sentence  John  forgot  to  let  the  dog  out  directly  implies  that  John  did  not  let  the 
oog  out  (Karttunen,  1971).  Thus  one  can  study  how  people  extract  information 
from  the  implications  of  implicitly  negative  predicates.  In  a study  by  Just  and 
Gark  (1973,  Expt.  II),  subjects  were  presented  with  an  affirmative  sentence 
(John  remembered  to  let  the  dog  out)  or  an  implicitly  negative  one  (John  forgot 
to  let  the  dog  out)  and  then  were  timed  as  they  verified  the  probe  sentence  (e.g.. 
The  dog  is  in)  u true  or  false  of  the  implication  of  the  parent  sentence.  The 
relevant  information  from  a sentence  like  John  forgot  to  let  the dog* out  is  that 
the  dog  is  not  out.  This  may  have  been  represented  as  (MEG,  (OUT,  DOG)  ) . The 
infon.tation  from  a sentence  with  remembered  to  would  be  represented  as  (AFF, 
(OUT  DOG) ).  This  sentence  representation  would  be  compared  to  a representa- 
tion of  the  probe,  like  The  dog  is  in,  represented  (AFF,  (IN,  DOG)  ).  The  model 
predicts  that  verification  latencies  should  increase  linearly  from  true- 
remembered,  to  (Use -remembered,  to  fz\x-forgot,  to  tru e-forgot.  Jbe  data 
conform  very  nicely  to  predictions  of  the  model,  which  accounts  for  94.6%  ot 
the  variance  among  the  four  conditions,  as  shown  in  Table  4.  This  result  shows 
that  the  implications  in  implicit  negatives  are  processed  similarly  to  explicitly 
negative  sentences. 


Present- Absent 

A similar  kind  of  verification  task  provides  evidence  that  indicates  absent  is 
internally  represented  as  a negation  of  present.  Sentences  like  The  star  is  p~esent 
or  The  star  is  absent  were  verified  against  pictures  that  e-.'her  contained  a star  (*) 
or  a plus  (+),  Clark  (in  press)  If  absent  is  internally  represented  as  a negative, 
then  a sentence  like  The  star  is  absent  might  be  represented  as  (A  EG, 
(PRESENT,  STAR)).  This  representation  would  be  compared  to  the  representa- 
tion of  the  picture,  either  (PRESENT,  STAR)  or  (PRESFNT,  PLUS),  in  this 
experiment.  These  representations  can  be  used  to  generate  the  predictions  of  the 


258  M.  A.  JUST  AND  P.  A.  CARPENTER 


model.  The  latencies  should  increase  linearly  from  hue— present,  to  false- 
present,  to  (ahe-absent,  to  tru e-absent.  As  Table  4 shows,  the  model  accounts 
for  94.8%  of  the  variance  among  the  four  means,  with  an  estimated  169  ,isec 
per  constituent  comparison.  Thus,  the  results  support  the  hypothesis  that  in  this 
task,  absent  is  interpreted  as  an  implicit  negative.  Moreover,  the  quantitative 
relations  among  the  four  latencies  support  the  idea  that  theie  is  a serial  retrieval 
and  comparison  of  constituents  from  the  internal  representations  of  the  sentence 
and  picture. 

The  next  several  experiments  are  tasks  in  which  the  instructions  contained 
implicit  negatives.  We  will  show  that  the  comparison  process  postulated  by  the 
model  for  sentence  verification  also  explains  how  people  understand  and  execute 
simple  instructions. 

Same-Different 

One  experiment  that  used  negative  instructions  involved  comparing  a word 
(either  the  word  circle  or  square)  to  a picture  (of  either  a circle  or  a square) 
(Seymour,  1969).  One  groun  if  subjects  was  given  an  affirmative  instruction; 
they  were  asked  to  respond  “yes”  if  the  word  and  the  picture  had  the  same 
meaning  and  to  respond  no  otherwise.  Another  group  of  subjects  was  given  a 
negative  instruction:  respond  “yes”  if  the  word  and  picture  arc  different,  and 
“no”  otherwise.  The  instruction  involving  the  predicate  different  may  have  been 
represented  with  a negative:  respond  “yes”  if  the  picture  is  not  the  same  as  the 
word— (ATTG,  (IS,X)),  where  the  symbol  X takes  the  value  denoted  by  the  word. 
For  example,  when  the  word  was  circle,  it  would  be  coded  into  the  instruction 
as  (NEC.  (IS,  CIRCLE)).  Then,  this  representation  would  be  compared  to  the 
pic.ure  representation.  If  the  picture  was  a square,  the  comparison  between  the 
representation  of  the  instruction  and  the  representation  of  the  picture,  (IS, 
SQUARE. ),  would  result  in  a “yes”  response.  If  the  instruction  were  compared 
to  a picture  of  a circle,  the  comparison  would  result  in  a “no”  response.  Each 
time  an  instruction  with  different  is  executed,  the  subject  is  essentially  process- 
ing a negative  construction.  Therefore,  the  model  predicts  that  latencies  should 
increase  linearly  from  “yes "-same,  »o  “no”-same,  to  "no" -different,  to 
"yes" -different.  As  Table  4 shows,  the  model  accounts  for  98.8%  of  the 
variance  among  the  four  means,  with  a slope  o!  82  msec  per  constituent 
comparison.  This  supports  the  hypothesis  the  different  is  internally  represented 
as  an  implicit  negative.  Moreover,  ihe  results  show  that  the  mental  processes 
involved  in  executing  instructions  can  be  explained  by  the  model  for  sentence 
verification. 

Agree-Conflict 

A very  similar  experiment  by  Trabasso etal.  (Expt.  X,  197 Dean  be  analyzed  to 
test  whether  the  predicate  conflict  is  internally  represented  as  a negative  The 
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task  was  to  compare  a word  (either  orange  or  green)  to  a picture  that  was 
colored  either  orange  or  green.  One  group  of  subjects  was  given  an  affirmative 
instruction,  judge  whether  or  not  the  word  and  the  picture  agree.  Another  group 
was  given  an  implicitly  negative  instruction:  judge  whether  or  not  the  word  and 
picture  conflict.  The  instruction  involving  conflict  might  be  represented  as 
(NEG,  (IS,  X)),  where  the  symbol  X would  take  the  value  of  the  coior  word 
presented  during  the  trial.  For  example,  suppose  the  word  orange  were  pre- 
sented; it  would  be  coded  into  the  instruction  as  (NEG,  (IS,  ORANGE ))  and 
then  compared  to  the  picture.  If  the  picture  were  colored  green,  the  comparison 
between  the  representation  of  the  instruction  and  the  representation  of  the 
picture,  (IS,  GREEN),  would  result  in  a response  of  “yes.”  If  the  instruction 
were  compared  to  an  orange  picture,  it  would  result  in  a response  of  “no.”  These 
representations  lead  to  the  prediction  that  the  latencies  should  increase  linearly 
from  “yes "-agree,  to  “no "-agree,  to  “no" -conflict,  to  “yes '-conflict.  As 
Table  4 shows,  the  model  accounts  for  99.4%  of  the  variance  among  the  four 
means,  with  an  estimate  of  109  msec  per  constituent  comparison.  This  supports 
the  hypothesis  that  the  processes  involved  in  following  simple  instructions  with 
agree  ana  conflict  are  the  operations  of  representing,  retrieving,  and  comparing 
constituents. 

Synonymous-Unrelated 

The  lexical  ;'.em  unrelated  may  also  je  represented  as  an  implicit  negative.  To 
test  this  hypothesis,  Hayden  and  Clark  asked  people  to  judge  the  semantic 
relation  between  two  words  that  had  the  same  meaning  (e.g.,  urge  and  big)  or 
different  meanings  (e.g.,  large  and  tidy)  (reported  by  Clark,  in  press).  One  group 
of  subjects  was  given  an  affirmative  instruction:  judge  whether  or  not  the  two 
words  are  synonymous.  Another  group  was  given  an  implicitly  negative  instruc- 
tion: judge  whether  or  not  the  two  words  are  unrelated.  The  instruction  with 
unrelated  may  have  been  represented  like  the  implicit  negatives  different  and 
conflict:  (NEG,  (MEANS,  X)),  where  X takes  on  the  value  of  one  of  the  two 
words  presented  in  a trial.  For  example,  suppose  the  pair  (large-tidv)  were 
presented.  The  first  word  might  be  coded  into  the  instruction,  (NEG,  (MEANS, 
LARGE)).  The  second  word  would  be  coded  as  (MEANS,  TIDY)  and  then 
compared  to  the  representation  of  the  instruction.  In  this  case,  the  response 
would  be  “yes.”  If  the  second  word  were  big,  the  response  would  have  been 
“no.”  The  model  predicts  that  latencies  should  increase  linearly  from  “yes”- 
synonymous,  to  “no" -synonymous,  to  “no” -unrelated,  to  “yes" -unrelated. 
As  Tab  e 4 shows,  the  model  accounts  ivH  >^.8%  of  the  variance  among  the 
means,  with  an  estimate  of  145  msec  per  consti  uent  comparison.  This  supports 
the  hypothesis  that  unrelated  is  represented  am  executed  as  a negative  instruc- 
tion. As  in  the  cases  of  different  and  conflict  ihe  negative  item  unrelated  takes 
longer  to  execute  and  causes  the  “yes”  response  to  take  longer  than  the  "no” 
response. 
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Except 

In  a very  different  kind  of  task,  Jones  (1966a,  b,  1968)  examined  how  people 
execute  instructions  that  contain  the  implicitly  negative  word  except.  In  these 
tasks,  people  would  read  an  affirmative  instruction  like  “Cross  out  the  numbers 
J » 3,  4,  6,  7,  or  a negative  instruction  like  “Cross  out  all  the  numbers  except  2, 
5,  8.”  Then  they  were  given  a sheet  that  was  filled  with  digits  1 to  8 in  random 
order  and  timed  while  they  performed  the  task.  The  two  instructions  require  the 
same  overt  responses;  but  if  except  is  a negative,  the  two  instructions  will  cause 
very  different  mental  operations.  We  hypothesized  that  the  instruction  with 
except  is  represented  as:  (NEG,  (IS,  2 or  5 or  8)).  Each  digit  encountered  on  the 
page  would  be  represented  (IS,  X),  where  X takes  the  value  of  the  digit.  The 
digit  w:ll  be  crossed  out  if  there  are  two  mismatches  between  the  two  representa- 
tions. This  would  happen  if  X took  the  value  7,  for  example.  By  contrast,  the 
affirmative  instruction  would  be  represented  (AFF,  (IS,  I or  3 or  4 or  6 or  7 )). 
Each  digit  encountered  on  the  page  would  be  represented  as  (IS,  X)  so  the  digit 
would  be  crossed  out  if  there  were  no  mismatches.  The  negative  instruction  took 
a significant  13  sec  longer  to  execute,  and  resulted  in  significantly  more  false 
positive  errors  (crossing  out  digits  that  weren’t  supposed  to  be  crossed  out). 

In  a second  experiment,  Jones  equated  the  number  of  digits  to  be  represented 
in  an  instruction.  The  positive  instiuction  was  cross  out  the  digits  3,  4,  7,8.” 
The  negative  instruction  was  to  “cross  out  all  members  except  1, 2,  5,  6.”  Again, 
the  negative  instruction  took  much  longer  to  execute  (by  100  sec)  and  resulted 
in  more  false  positive  errors.  Thus,  executing  a negative  instruction,  even  in  a 
very  different  kind  of  task,  takes  significantly  longer  than  executing  the  equiva- 
lent affirmative  one.  This  is  consistent  with  the  hypothesis  that  mismatches 
between  the  internal  representation  of  an  instruction  and  the  representation  of 
some  second  source  of  information,  will  lead  to  longer  latencies. 

The  preceding  analysis  makes  it  clear  that  certain  single  words  are  internall/ 
represented  by  two  components -an  affirmative  core  as  well  as  a negative 
component.  While  wc  cannot  specify  a priori  whether  or  not  a word  is  internall 
represented  as  a negative,  our  model  does  provide  a procedure  to  discriminate 
negative  lexical  items  from  affirmative  items.  If  the  verification  latencies  for  a 
suspect  word  are  shorter  for  false  than  for  true,  then  we  infer  that  word  is  being 
represented  and  processed  as  a negative  in  that  situation  Thus  the  results  cited 
above  show  that  forget,  different,  unrelated,  conflict,  and  except  are  represented 
as  negatives.  The  same  kind  of  analysis  can  also  exonerate  suspect  words.  For 
example,  small  is  not  processed  as  a negation  of  lar^e,'  it  is  represented  as  an 
affirmative  (Carpenter  & Just,  1972;  Just  & Carpenter,  1971). 


Other  Instructional  Examples 

Comprehending  an  instruction  can  be  a major  source  of  difficulty  in  perform- 
ing an  everyday  task.  We  sometimes  encounter  complex  instructions  where 
there  is  no  conceivable  pupose  for  their  complexity.  The  following  notice  from 
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the  Internal  Revenue  Service  (Form  4918)  provides  an  example  (italics  added): 

ANY  QUESTIONS  ABOUT  THIS  NOTICE? 

In  the  event  ws  failed  to  give  you  credit  for  a Federal  tax  deposit  or  any  other 
payment  you  made,  please  accept  our  apology  and  be  guided  by  the  following: 

1.  If  the  payment  not  credited  was  made  within  the  Inst  four  weeks  or  :o,  we  will 
credit  it  soon.  You  need  not  write  us.  Just  subtract  the  payment  we  haven’t  included. 

2.  If  the  payment  not  credited  was  made  more  than  four  weeks  ago,  subtri  ct  it  from 
the  balance  due  . . . 

3.  If  you  have  paid  the  entire  balance  due  within  the  last  four  weeks,  please  disregard 
this  notice. 

Please  send  us  an  explanation  if  'he  balance  is  incorrect  for  any  reasons  other  than 
payments  we  haven't  credited. 

Explanation  of  Penalty  or  Interest  Charges 

Your  return  was  not  Tiled  and  your  lax  wn  not  paid  by  he  due  date.  The  combined 
penalty  is  3 percent  of  the  tax  not  timely  paid  for  each  month  of  part  of  a month  the 
return  was  late,  but  not  more  than  Z 5 percent. 

This  notice  is  not  'he  result  of  an  audit  of  your  return.  When  we  select  a return  for 
audit,  we  notify  the  taxpayer. 

Presumably  the  IRS  is  not  interested  in  testing  our  comprehension  skills. 

In  other  situations,  the  purpose  of  a complex  instruction  is  precisely  to  assess 
comprehension  skills,  as  illustrated  by  the  following  item  (Personnel  Test,  Form 
D„  E.  F.  Wonderlic,  1922): 

Count  each  Z in  this  series  that  is  followed  by  an  F next  to  it  if  the  F is  not  followed 
by  an  S next  to  it.  Tell  how  many  Z’s  you  count. 

ZFZSEYZFSYFZFFSYSZFEZFSFZYFZFY 

This  is  an  extremely  easy  task  if  one  comprehends  the  instruction,  which  could 
have  been  simplified  as  follows:  Count  the  occurrences  of  the  sequence  (/Zl,  /Ff, 
/non-S/)  in  this  series. 

There  are  other  situations  where  the  purpose  of  complex  instructions  is  not 
clear.  The  following  example  is  taken  from  an  aptitude  test  for  prospective 
students  of  management  science  (Graduate  Study  in  Management:  A Guide  for 
Prospective  Students,  1974) 

Directions:  Each  of  the  data  sufficiency  problems  below  consists  of  a question  and  two 
statements,  labeled  (1)  and  (2),  in  which  certain  data  are  given.  You  have  to  decide 
whether  die  data  given  in  the  statements  are  sufficient  for  answering  the  questions.  You 
are  to  blacken  space 

A.  if  statement  (1)  ALONF  is  sufficient,  but  statement  (2)  alone  is  not  sufficient  to 
answer  the  question  asked; 

B.  if  statement  (2)  ALONE  is  sufficient,  but  statement  (1)  alone  is  not  sufficient  to 
answer  the  question  asked; 

C.  if  BOTH  statements  (1)  and  (2)  TOGETHER  ate  sufficient  to  answer  the  question 
asked,  but  NEITHER  statement  ALONE  is  sufficient; 

D if  EACH  statement  ALONE  is  sufficient  to  answer  the  question  asked; 
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This  kind  of  instruction  seems  to  be  testing  both  the  ability  to  comprehend 
instructions,  as  well  as  knowledge  of  geometry  and  logic.  Incorrect  answers 
could  be  caused  by  any  of  these  three  sources.  The  relative  contribution  of 
comprehension  difficulties  can  be  assessed  by  rewriting  these  instructions  in  a 
simplified  format. 

Revised  Directions:  Answer  YES  or  NO  to  each  of  the  following  questions. 

In  A PDR,  can  you  detemine  the  value  of  x if  all  you  know  is  that: 

(1)  PQ  ‘PR?  (Answer  is  NO) 

(2)  y m 40?  (Answer  is  NO) 

(3)  PQ  « PR  and  y = 40?  (Answer  is  YES ) 

These  examples  illustrate  how  successful  performance  in  a test  may  depend  on 
comprehension  skills  in  decoding  the  instruction,  as  well  as  the  content  skills 
that  the  test  ostensibly  taps.  Thus,  both  components  may  enter  into  the  test 
scores  that  can  often  predict  future  academic  performance.  It  may  turn  out  that 
the  predictive  ability  of  the  test  is  partially  due  to  the  comprehension  skills  it 
taps,  rather  than  the  content  skills.  If  the  test  is  being  used  only  for  actuarial 
purposes,  the  relative  loadings  of  the  two  factors  are  irrelevant.  However,  if  the 
testing  is  for  diagnostic  purposes,  then  it  it  necessary  to  assess  the  relative 
contribution  of  comprehension  skills  before  remedial  action  can  be  taken  (cf. 
Hunt,  Frost,  & Lunneborg,  1973).  This  may  prove  to  be  a fruitful  approach  to 
test  construction. 

In  many  situations,  the  primary  purpose  of  an  instruction  is  to  inform,  to  help 
people  perform  correctly  and  efficiently.  For  example,  instructions  on  income- 
tax  forms  or  in  repair  manuals  should  be  constructed  to  minimize  comprehen- 
sion difficulties.  A theory  of  sentence  comprehension  such  as  we  have  outlined 
suggests  the  kinds  of  problems  that  may  arise  in  representing  and  executing 
various  kinds  of  instructions.  The  theoretical  approach  also  suggests  ways  of 
making  everyday  instructions  easier  to  comprehend. 


E.  if  statements  (1)  and  (2)  TOGETHER  are  NOT  sufficient  to  answer 
asked,  and  additional  data  specific  to  the  problem  are  needed. 

P 

1. ample:  In  A PQR,  what  is  the  value  of  x?  a 

(1  )PQ‘PR 

(2)  y - 40  / 

Answer.  C 0 ‘ 


EDUCATION  * L TESTS  OF  VERBAL  COMPREHENSION 
Sentence-Con  pie' ion  Tasks 

Another  domain  in  which  we  can  apply  ou  information  processing  analysis  is 
the  sentence-completion  task,  which  often  appears  in  tests  of  academic  achieve- 
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ment  or  ability.  This  task  involves  choosing  one  of  several  alternatives  that 
best  completes  a sentence  frame.  Consider  the  following  example: 

Beauty  is  only  skin-deep,  but goes  all  the  way  to  the  bone. 

a.  disease  b.  blood  c.  ugliness  d.  fright  e.  liniment 
(Answer:  c) 

Although  performance  in  this  task  depends  to  some  extent  upon  adequate 
vocabulary,  much  of  the  processing  can  be  explained  in  terms  or  the  processes 
described  by  the  comprehension  model. 

Many  of  the  items  in  a sentence-completion  test  have  structures  that  are 
basically  like  the  example  above.  These  items  consist  of  two  parallel  clauses  of 
the  same  syntactic  type,  although  there  may  be  a negative  lurking  in  one  of 
them.  The  misring  item  is  a constituent  of  one  of  the  clauses.  The  connective 
between  the  two  clauses  is  either  affirmative  (e.g.,  and)  or  negative  (e.g.,  but). 
The  polarity  of  the  connective,  as  well  as  the  presence  of  a negative  in  one  of  the 
clauses,  determines  whether  the  missing  item  should  be  an  antonym  or  synonym 
of  the  corresponding  constituent  in  other  c'luse.  In  the  example  above,  the 
negative  connective  but  is  a cue  that  the  answer  is  an  antonym  of  beaut}’ 

A number  of  examples  wiii  give  the  flavor  of  the  kinds  of  sentence  completion 
items  that  involve  negative  connectives,  like  yet,  but,  unlike,  whereas,  and 
although  (taken  from  a booklet,  Prepa~ation  for  college  board  examinations  by 
Henry  Regnery  Co.,  1972,  pp.  107-126): 

Unlike  his  cousin,  the  artist,  who  was  colorful,  whimsical,  and  erratic,  the  teacher  was 
prosaic, , and  consistent. 

a.  infallible  b.  commonplace  c.  objective  d.  disorganized  e.  subtle 

Though  he  was  romantic  and  sen«i  lal  in  his  outlook,  his  life  was  one  of . 

a.  profligacy  b.  naivete  c.  a jsterity  d.  virtuousity  e.  maturity 

These  conditions  are  not t’.e  nature  of  women  but  have  grown  up  in  spite  of  it. 

a.  intrinsic  to  b.  pararr.iunt  in  c.  compelling  in  d.  immutable  ir.  e.  extrinsic  in 

Early  in  the  I9th  century,  in  the  South,  it  had  become  the  fashion  to  raise  only  one 
staple  crop,  whereas  in  the  North  the  crops  were . 

a.  diversified  b.  unstable  c.  fallow  d.  uniform  e.  wild 

In  this  game  he  was  an  amateur,  not  an  expert,  and  thus,  for  the  first  time,  became  a(n) 

instead  of  a man  of  action. 

a.  connoisseur  b.  spectator  c.  lawyer  d.  pragmatist  e.  authority 

Linguistic  analyses  of  the  clausal  conjunction  but  show  that  it  involves  incon- 
gruence between  the  two  clauses.  For  example,  but  may  be  used  it  there  is  a 
lextcai  contrast  between  the  two  clauses,  e.g.,  Mary  likes  school  but  John  hates 
it.  A second  use  of  but  involves  a contrast  between  what  is  stated  and  what  the 
speaker  believes  to  be  the  usual  connection  between  the  two  clauses,  e.g..  Bill  is 
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(TS  b^elh°rSt  °:Q?QCkJl  a veterina™  he  doesn't  like  dogs 
1/L  l ’ ’ G!Tan>  1%9:  Dik'  1968>-  Other  connectives  like  instead 

Cans ef  ’ ^ 0/’  3nd  yet  Conj°in  similar  ^"ds  of  contrasting 

/t07Tt  T;i'meun,al  inVeStigati°n  °f  the  c°mpletion  task,  Osgood  and  Richards 
( 973)  asked  subjects  to  complete  sentences  like  X is  beautiful  _ dumb  or  X is 

0 — Sl°W ' Wlth  ™d  or  but ■ The  adjectives  in  the  sentence  either  had  the 
same  or  opposite  affective  polarity,  which  was  determined  a priori  with  the 
emant.c  deferential.  As  the  linguistic  analysis  would  predict,  incongruence 
between  the  two  lexical  items  was  a more  favorable  environment  foTbut 
wherea8  congruence  was  a more  favorable  environment  for  and. 

The  comparison  model  suggests  the  processes  that  might  underlie  performance 
m this  completion  task.  First  the  sentence  is  parsed  into  two  parcel  clauses 
en  the  constituents  of  the  clauses,  including  the  coordinate  conjunction  and 
polarity  markers  are  checked  serially  for  their  polarity.  The  number  of  negatives 

is  a synonym  or  antonym  °f  the  pr°v,ded 

This  model  of  processing  can  be  tested  with  data  collected  during  the  sentence 
completion  task.  Hoosain  (1973)  measured  latencies  while  people  completed 
fences  ike  those  in  the  Osgood  and  Richard’s  task,  and  he  aho  varied  the 
number  of  exphc.t  negatives  in  the  sentence.  For  example,  a sentence  could 
involve  adjectives  of  similar  affective  polarity  (e.g.,  Eve  was  mild  nice ) or 

eTr  nV nevT  ?77  ^ ^ ™ tr°ubled -h°PPxU "d  co^  contain 
no  negatives  at  all,  one  negative  (e.g.,  Eve  was  mild not  nice)  or  two 

negatives  (e.g.,  Eve  was  not  mild  not  nice). 

As  might  be  expected,  latencies  increased  as  the  number  of  negatives  in  the 

:~K7d  r°m  Z6r°  ,0  °"e  «°  tWO’  Furth™’  ^encieLere  short 

not  affectldL  Je,h,Ve?  C°ngrUent  in  affeCtiVe  Polarity-  difference  was 
affected  by  other  factors,  such  as  the  number  of  extra  operations  caused  by 

procts  LT  t ll  h"3'1  TtlVeS'  ThC  reSU‘,S  CO,nPle,ely  consistent  with  a 

"j‘T/eT  1 rsma,ch  be,w“n  ,he  1 

anrumative  frame  with  which  i,  is  compared.  Such  mismatches  cause  extra 

tttT’  rat,'°nS  are  addi,ive'  Thus-  the  basic  processes  involved  ,n 

this  sentence-completion  task  are  quite  similar  to  the  ones  imolved  in  compre- 
hend,ng  and  verifying  sentences,  although  the  control  structures  may  be  differ- 
for  the  two  tasks.  Processes  in  both  tasks  involve  serially  examining  the 
constituents  o . representations,  encountering  mismatches,  and  consequently 
^forming  additional  mental  operations.  This  analysis  has  attempted  to  show 
that  performance  on  a common  item  from  a test  of  verbal  skills  can  be  an- 
talks  m “"denying  mental  operations  found  in  other  comprehension 
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Potential  Applications:  The  Reading  Comprehension  Test 

In  this  section,  we  will  try  to  outlin  1 the  kinds  of  representations,  retrieval  and 
comparison  operations  in  another  task  involving  verbal  comprehension:  the 
reading  comprehension  test.  This  task  is  much,  more  complex  than  the  other 
ones  that  we  have  analyzed.  The  reading  comprehension  test  involves  reading  a 
passage,  usually  150-500  words  long,  and  then  answering  8 to  12  multiple-choice 
questions  about  the  passage.  The  instructions  ire  to  first  skim  the  passage  then 
read  the  questions  and  return  to  the  passage  for  information  when  it  is  neces- 
sary. The  time  allotted  to  read  th:  passage  and  answer  all  the  questions  is  usually 
5 to  15  minutes.  We  studied  this  task  by  having  three  subjects  express  their 
strategies  and  thoughts  aloud  v/hilc  they  performed  several  reading  comprehen- 
sion tests.  Thus,  this  section  represents  a potential  extension  of  the  general 
approach,  rather  than  an  emp  rically  confirmed  model. 

During  the  initial  reading  of  the  paragraph,  the  theme  or  central  proposition  of 
the  passage  it  generally  extiacted  and  represented.  Our  subjects  indicated  they 
had  represented  the  thematic  information  by  their  ability  to  answer  the  ques- 
tions about  the  theme  without  looking  back  at  the  passage.  In  other  studies,  it 
has  been  shown  that  if  subjects  are  kept  from  knowing  the  theme,  both 
comprehension  and  memory  for  the  passage  suffers  (Bransford  A Johnson,  1973, 
Dooling  A Lachman,  1971).  Also,  when  recognition  memory  for  individual 
sentences  in  a passage  is  tected,  there  is  a much  higher  false-alarm  rate  for 
distractor  sentences  that  contain  the  theme  (Singer  & Rosenberg,  1973).  (See 
also  Hyman,  Chapter  8,  this  volume).  These  results  indicate  that  the  thematic 
information  plays  a central  role  in  the  representation  of  the  passage. 

The  initial  representation  of  the  passage  also  contairs  information  about 
higher-order  relations  that  exist  between  the  thematic  proposition  and  subsidiary 
propositions.  These  are  relations  such  as  causality  and  temporal  order  of  events, 
which  are  sometimes  cued  by  words  like  because,  consequently,  after,  before, 
and  so  on.  The  represent stion  of  individual  propositions  linked  by  higher-order 
relations  can  be  accommodated  by  a number  of  representational  schemes  (cf. 
Crothers,  1972,  Kintsch,  1972;  Rumelhart,  Lindsay  A Norman,  1972,  Schank, 
1972).  Subjects  often  su  ed  the  occurrence  of  such  higher-order  relations 
without  storing  the  content  of  the  subsidiary  proposition.  For  example,  after  the 
initial  reading  a subject  might  remember  that  the  consequences  of  a certain  event 
were  listed,  but  be  unable  to  recall  th*  specific  instances. 

The  third  kind  of  information  extracud  during  the  initial  reading  is  a represen- 
tation of  the  information  development  in  the  passage.  Subjects  seemed  to  More 
information  that  could  act  as  a pointer  to  a particu'ar  part  of  the  passage  when  a 
question  required  specific  information.  In  a .wise,  the  printed  passage  was  used 
as  an  external  memory,  and  the  internal  repiesentation  served  as  an  indexing 
system  for  that  external  memory.  Our  subjects  often  knew  where  to  look  in  the 
passage  for  specific  information.  For  example,  if  a question  alluded  to  a specific 
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fact,  the  subject  would  say  “I  remember  something  about  that  just  before  the 
end”  or  “ — that  appeared  in  the  middle.”  Then,  he  would  proceed  to  search 
through  the  appropriate  part  of  the  passage.  Of  course,  some  of  these  strategies 
are  probably  due  to  the  task  conditions,  which  emphasize  speed,  but  permit 
subjects  to  look  back.  In  summary,  it  appears  that  after  the  initial  reading,  our 
subjects  had  a record  of  the  location  of  certain  information  in  the  passage,  ta 
well  as  the  main  theme  and  a list  of  some  relations  between  the  theme  and 
subsidiary  propositions. 

Our  approach  to  the  reading  comprehension  test  is  to  focus  on  representing, 
indexing,  retrieving,  and  comparing  information.  Although  our  approach  de- 
emphasizet  the  obvious  factor  of  vocabulary,  ix.  previous  knowledge  of  the 
words  in  the  passage,  experimental  evidence  suggests  that  such  de-emphasis  may 
be  justified.  Tuinman  and  Brady  (1973)  showed  that  thorough  pretraining  on 
vocabulary  items  from  the  reading  passage  did  not  raise  the  comprehension 
scores  of  children  in  grades  four  to  six.  While  some  minimal  knowledge  of  the 
vocabulary  is  dearly  a necessary  condition  for  successful  performance,  it  is  not 
sufficient  to  improve  performance  beyond  a given  level.  This  study  suggests  that 
the  important  skill  in  rearing  comprehension  is  the  ability  to  represent  and 
manipulate  the  information  presented  in  the  passage  aid  questions. 

The  advantage  of  analyzing  the  reading  comprehension  task  in  terms  of 
information  processing  theory  is  that  it  de Tines  the  relevant  empirical  questions 
to  be  answered.  One  process  to  be  explored  is  the  mechanism  that  abstracts  the 
theme.  For  example,  it  is  possible  that  the  thematic  proposition  is  the  one  that 
occurs  most  frequently  lu  the  passage,  as  suggested  by  the  simulation  model  of 
Rosenberg  (I9T4).  Another  issue  to  be  explored  is  the  jredse  representation  of 
the  indexing  system  that  records  where  facts  were  mentioned  in  the  paragraph. 
Yet  another  is  the  determinination  of  how  particular  questions  t p into  this 
index.  This  analysis  provides  an  outline  of  how  a complex  task  like  the  reading 
comprehension  test  can  be  approached  in  terms  of  the  basic  components  of  the 
comprehension  process—  Ihe  representation,  retrieval,  and  comparison  of  infor- 
mation. 


CONCLUSIONS 

What  Makes  a Sentence  Hard  to  Praeess? 

The  comprehension  model  makes  live  claim  that  a sentence  is  difficult  to  process 
when  it  doesn’t  match  the  representation  of  some  second  source  of  information 
Thus,  the  critical  variable  that  determines  processing  difficulty  is  the  number  of 
matches  or  mismatches  between  two  representations,  the  critical  factor  is  not 
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affirmation  or  negation,  per  se.  According  to  the  model,  negatives  are  harder  to 
process  only  when  they  mismatch  with  the  affirmative  representation  of  other 
information.  For  example,  pictures  are  generally  reprt*ented  affirmatively , so 
sentences  that  ref  : .o  pictures  are  generally  easie'  to  process  if  they  art 
affirmative.  Similaily,  the  information  stored  in  seruriJc  memory  is  usually 
stored  in  some  affirmative  form,  so  the  comprehension  of  the  sentence  referring 
to  semantic  memory  >s  usually  easier  if  the  sentence  is  affirmative.  However,  the 
implication  o,"  the  m<iJe!  is  clear -negatives  are  not  necessarily  harder  to  process 
than  affirmatives;  mismatches,  pithcr  than  negation  per  se,  determine  »he  ease  of 
comprehending  linguistic  information. 

When  Negatives  Are  Easier 

The  model  predicts  that  a negative  sentence  should  be  easier  than  an  affirmative 
if  the  Information  from  the  other  source  were  represented  negatively.  Then,  the 
negative  sentence  would  match  the  representation  of  the  second  source  of 
information  and  the  comparison  would  be  faster.  By  contrast,  the  affirmative 
sentence  would  mismatch  and  processing  would  take  longer.  In  fact,  our  analysis 
of  an  unusual  reasoning  task  supports  this  prediction.  Johnson-Laird  and  Tridgell 
(1972)  presented  sublets  with  a disjunctive  premise  (p  v (?)  and  a probe  (~q), 
and  asked  the  subjects  to  draw  a conclusion  (p).  The  premise  contained  two 
clauses  like  Either  John  is  intelligent  or  John  is  rich.  The  probe  sentence  always 
had  a different  truth  value  than  one  of  the  two  clauses  in  the  premise,  for 
example,  John  is  not  rich , so  the  conclusion  was  the  remaining  clause,  is.  John 
is  intelligent. 

The  task  required  that  the  subject  ask  himself  whether  a clause  in  the  premise 
conflicts  with  the  probe.  This  self  instruction  may  have  caused  the  same  kind  of 
internal  representation  that  we  postulated  for  instr  lions  involving  conflict, 
different,  and  distqree.  The  relevant  clause  in  the  premise  may  have  been 
encoded  into  negative  instruction  and  then  compared  to  the  probe.  For  the 
example  Either  John  is  intelligent  or  John  is  rich,  the  second  clause  may  have 
been  coded  into  the  instruction  (NEC.  ( X )).  so  that  it  resulted  in  the  represen- 
tation (NEC.  (RICH.  JOHN))'  This  was  then  compared  to  the  probe,  John  is  not 
rich,  represented  (NEC,.  (RICH,  JOHN)).  The  model  predicts  that  such  a 
negative  probe  would  be  processed  faster  than  an  affirmative  probe  like  John  is 
poor,  represemed  as  (AFF.  (POOR.  JOHN)  ).  As  predicted,  the  response  latency 
to  negative  probes  was  shorter  (by  1.6  sec)  than  the  latency  for  the  affirmative 
probe.  The  model  correctly  makes  the  nonintuitive  prediction  that  the  negative 
probes  are  processed  faster  in  this  situation.  This  supports  the  argument  that 
mismatches,  rather  than  negatives  per  se.  consume  processing  time.  Thus,  it  is 
the  relationship  between  two  representations  that  determines  the  speed  of 
comparison  processes. 
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Processing  Instructions  in  an  Everyday  Situation 

It  was  recently  shown  that  in  a highly  realistic  situation,  people  remember 
affirmative  instructions  much  better  than  their  negative  counterparts.  The  situa- 
tion was  an  airport,  where  eighty  waiting  airline  pasaengers  were  asked  so  read  or 
listen  to  a 200  word  passage  describing  in-flight  emergency  procedures,  based  on 
actual  airline  protocol  (File  A Jew,  1973).  The  individual  instructions  were 
either  affirmative  (eg..  E)  tmguish  cigarettes.  Remove  shoe .)  or  the  correspond- 
ing negative  ses  (eg.,  Do  not  leave  cigarettes  lighted.  Do  not  keep  shoes  on. ).  The 
results  showed  that  the  passengers  recalled  about  20%  more  information  from 
.iflirmative  Listructions  than  from  negative  instructions.  The  better  recall  of 
affirmative  instructions  may  i ave  been  the  consequence  of  fewer  mental  opera- 
tions during  comprehension.  Because  the  affirmatives  are  comprehended  faster, 
subjects  may  have  had  more  time  to  transfer  information  into  long-term 
memory.  Thi  Significance  of  this  study  is  clear  laboratory -based  theories  of 
comprehension  do  apply  to  real  situations  involving  cnticaJly  important  instruc- 
tions. 

We  have  examined  several  tasks  that  involve  verbal  comprehension  in  instruc- 
tional rettings.  The  focus  has  been  how  the  information  in  a sentence  is 
represented  and  manipulated.  We  have  pro  io*ed  a general  mode!  to  account  for 
comprehension  in  a variety  of  filiations,  such  as  verifying  or  completing  sen- 
tences and  executing  instructions.  The  kinds  of  tasks  surveyed  and  the  analysis 
have  both  practical  an,',  theoretical  importance.  On  the  practical  ride,  this  tind 
of  analysis  may  help  to  localize  the  difficulties  that  an  individual  has  n verbal 
comprehension.  Moreover,  this  approach  could  lead  to  a set  of  rules  for  writing 
easily  comprehensible  instructions.  The  analysis  of  these  tasks  in  terms  of 
fundamental  processes  helps  to  unravel  the  Gordian  Nor  of  verbal  comprehen- 
sion. 
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Understanding  Comolex 
Task  instructions 
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In  the  ninth  C.imepe  Symposium  on  C opinion.  we  described  the  processes  used 
by  human  subjects  to  un  demand  the  instructions  in  a problem-solving  tail 
(llayes  A S»itv«.  1*1741  The  aim  of  tha  chapter  is  to  build  upon  the  theory  of 
the  understanding  process  (and  the  UNDfcRSTAND  computer  program)  devel- 
oped in  that  previous  paper,  and  to  draw  out  the  implications  o»  the  theory  for 
educational  processes  and  pracoces  The  discussion  will  be  organized  under  three 
nvn  topics  the  UNDERSTAND  theory  and  some  directions  m whi.h  it  needs 
to  hr  e*  tended  the  rok  of  prior  knowledge  si  understanding,  and  the  pedagogi- 
cal implications  of  a theory  of  underttirufcng  It  will  focus  upon  a strategy  for 
analyzing  tasks  with  a view  u>  dncovrnng  what  is  involved  in  understanding 
them 

Much  concern  is  expressed,  from  time  to  time,  aboui  “functional  ilhteiacy’  in 
our  uk lets  A functional  ithirraie  is  someone  iho  cannot  perform  the  reading 
tasks  with  which  ha  job  and  hu  daily  life  confront  fum  The  illiterate  cannot 
read  and  understand  the  directions  on  the  medacwx  bottle,  the  do-tt-s ourself  lot. 
or  the  soup  can  He  cannot  understand  the  fine  wait  on  the  traffic  ticket  or  read 
has  personal  mail  Ik  cannot  read  the  instruction  manual  for  a piece  of  equip- 
ment used  hi  h»s  job  Although  only  a modesi  amount  of  research  ha*  been  done 
on  the  causes  of  funcuonai  illiteracy  the  diU  that  are  available  suggest  two 
generaluatasns  about  it 

I Functtonal  tniteracy  is  primarily  ass  understanding  problem,  rather  than  a 
readme  problem  That  is  to  say . most  functional  Jb rentes  in  our  sockis  possess 
the  bciiv  decoding  uilh  for  revhng  the  printed  word  What  they  lack  is 
adequate  vocabulary  . knowledge  ar.d  skill  to  nrrrpret  the  communications  that 
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are  directed  to  them,  regardless  of  whether  those  communications  are  written  or 
spoken.  This  has  been  demonstrated  very  convincingly  by  Sticht  0972),  who 
has  shown  that  if  a person  can  understand  an  oral  communication,  he  can  almost 
always  understand  the  same  communication  presented  to  him  in  writing,  and 
vice  versa.  Hence  any  major  attack  on  adult  functional  illiteracy  must  be 
directed  at  improving  language  understanding  skills,  and  not  simply  at  improving 
the  decoding  skills  of  beginning  reading. 

2.  Much  of  the  difficult  language  that  an  individual  encounters  in  daily  life  is 
found  in  the  instructions  he  receives  about  what  to  do  or  how  to  do  it. 
Therefore,  any  major  attack  on  adult  functional  illiteracy  needs  to  place  special 
emphasis  on  the  improvement  of  the  skills  of  understanding  instructions. 

If  understanding  instructions  is  an  important  component  of  functional  literacy 
for  adults  in  their  everyday  and  workaday  lives,  it  is  an  even  more  crucial  skill 
for  children,  adolescents,  and  young  adults  who  spend  their  days  in  school. 
Schools  (including  universities)  are  the  most  persistently  evaluating  institutions 
in  our  society.  They  are  continually  testing  their  students,  and  the  heart  of 
testing  is  determining  whether  someone  ran  carry  out  successfully  a set  of  task 
instructions.  Someone  who  cannot  understand  instructions  cannot  pass  tests. 
Hence,  understanding  instructions  is  one  of  the  principal  skills  we  test  in  the 
schools. 

But  the  understanding  of  instructions  enters  school  tasks  by  still  another 
route.  Schools  teach,  or  attempt  to  teach,  their  students  to  solve  problems  in  a 
wide  range  of  domains.  The  first  step  in  solving  a problem  is  to  understand 
it—  that  is,  to  make  a meaningful  description  of  the  problem  situation  (This  view, 
that  to  understand  is  to  make  a description  of  the  situation,  is  elaborated  in  the 
next  section.)  But  a description  of  a problem  situation  is  nothing  more  nor  less 
than  an  instruction  that  defines  a task  and  requests  that  it  be  performed.  Hence, 
a significant  component  of  problem  solving  skill  in  any  domain  is  the  skill  of 
understanding  the  instructions  for  problems  in  that  domain. 


THE  NATURE  OF  UNDERSTANDING 

On.1  understands  task  instructions  if  he  can  program  himself  to  attempt  the 
task.  This  does  not  necessarily  mean  that  he  can  perform  the  task  successfully. 
Most  people  can  readily  understand  the  Four  Color  Theorem:  Any  plane  map 
can  be  colored  with  not  more  than  four  colors  in  such  a way  that  no  territories 
with  a common  border  have  the  same  color.  If  a person  knows  what  a proof  is, 
he  can  p<o?ram  himself  to  respond  to  the  instruction:  “Prove  the  Four  Color 
Theorem.”  His  chances  of  success,  however,  are  miniscule;  in  spite  of  the  efforts 
of  numerous  first-rate  mathematicians,  no  one  has  suco  eded  in  findings  proof 
of  the  Four  Color  Theorem-or,  for  that  matter,  a disproof. 
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(V.iat,  more  precisely,  does  it  mean  for  a person  to  understand  a task  that  he 
niay  or  may  not  be  able  to  perform?  At  a minimum,  it  means  that  he  can  test 
"he  adequacy  cf  a purported  performance  or  solution.  Bn  ordinarily,  we  expect 
something  moie  than  this  minimum  of  understanding.  A person  approaches  a 
new  set  of  task  instructions  equipped  with  certain  somewhat  general  problem- 
solving capabilities.  Somehow,  he  enlists  these  problem-solving  capabilities  in  the 
task  which  the  new  instructions  present  to  him.  The  process  of  understanding 
the  instructions  is  precisely  this  enlistment  process. 


A Specific  Theory  of  Understanding 

In  our  previous  paper,  we  reported  on  the  UNDERSTAND  program,  a computer 
simulation  of  the  u .demanding  process.  The  UNDERSTAND  program  embodies 
the  following  theory  of  the  process.  Before  any  specific  problem  can  be 
attacked,  it  must  be  described  in  terms  of  a problem  space— a space  of  situations 
that  may  be  visited  in  a search  for  a solution -and  a set  of  operators  or  ‘ moves” 
for  changing  one  situation  into  another  in  the  course  of  the  search. 

Consider  the  problem  of  choosing  a move  in  the  game  of  tic-tac-toe.  The 
problem  space  here  might  be  the  space  of  possible  game  situations,  of  arrange- 
ments of  crosses  and  circles  on  the  3X3  array.  The  move  operator  adds  a new 
cross  u.  urcle  ic  the  array.  Given  a way  of  representing  the  various  possible 
game  situations,  and  a way  of  moving  from  one  to  another  of  these,  a problem- 
solving program  like  the  General  Problem  Solver,  GPS  (Ernst  and  Newell,  1967) 
could  go  to  work  on  the  task  of  finding  a good  move  in  tic-tac-toe.  (Actually, 
GPS  would  need  a few  other  “givens”  beyond  those  mentioned  above,  but  the 
problem  space  and  the  move  operator  are  the  central  requisites  for  it.)  To  ask 
how  the  UNDERSTAND  program  goes  about  undemanding  written  problem 
instructions  is  to  ask  how  it  transforms  those  instructions  into  a problem  space  and 
a move  operator . As  we  showed  in  our  previous  paper,  tins  is  accomplished  by  the 
UNDERSTAND  program  and  by  human  subiects  as  well,  along  the  following 
lines: 

1.  The  input  instructions  are  analyzed  syntact'-ally  by  means  of  a parsing 
program. 

2.  A search  is  made  through  the  analyzed  text  for  sets  of  “objects”  that  need 
to  be  represented  in  the  problem  space,  and  sets  of  relations  among  these 
objects. 

3.  A structure  is  created  that  permits  objects  and  their  relations  (and  thereby 
situations)  to  b-  represented.  The  representation  is  used  to  store  information 
extracted  from  the  problem  instruction:;  about  the  initial  and  goal  situations. 

4.  A search  is  made  through  the  analyzed  text  for  operators  that  change  the 
relations  amors  objects. 
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*>  Semantic  memory  is  searched  for  possible  semantic  interpretations  of  the 
change  operators.  Associated  with  t.'.e  interpretations  in  semantic  memory  are 
programs  for  actually  making  the  corresponding  changes  in  memory  structures. 

6.  The  representation  created  in  step  (3)  is  described,  to  provide  the  informa- 
tion required  by  the  interpreted  change  operators.  These  operators  have  the  task 
of  actually  carrying  out  the  moves  from  one  situation  to  another  by  making  the 
appropriate  char:,'***',  in  the  represented  situation. 

Let  us  see  what  e,  n o means  in  terms  of  a simple,  concrete  example  Table 
I shows  the  instructions  ! >r  a “Monster  Problem.”  The  first  step  in  understand- 
ing this  problem  is  to  d reover  the  surface  structures  of  the  component  sen- 
tences, to  interpret  pronouns  and  other  backward  references,  and  to  carry  out 
the  other  syntactic  tasks  of  this  kind  that  would  be  performed  by  a parsing 
system. 

The  second  sttp  is  to  identify,  in  the  syntactically  analyzed  sentences,  the 
objects  involved  in  the  problem,  in  this  case  monsters  and  globes.  Similarly,  a 
number  of  relations  can  be  identified:  monsters  are  ordered  by  size,  as  are 
globes;  t,iere  are  relations  of  “holding”  and  of  “teleporting"  or  “giving.” 

In  the  third  step,  a way  must  be  found  for  representing  situations  in  which 
particular  monsters  hold  particular  globes.  In  a memory  capable  of  storing  list 
structures,  such  a representation  is  readily  .instructed  as  follows  A node  is 
created  m memory  labeled  “Current-Situation."  An  attribute.  “Monster-List."  is 
associated  with  that  node,  having  as  its  value  a list  of  the  monsters:  “Monster. I ," 
Monster. 2,  and  “Monster.3.”  With  each  monster  node  is  associated  an  attrib- 
ute. “Size.”  having  as  value  “Small,”  “Medium,”  or  “Urge.”  A second  attribute 
associated  with  each  monster  is  “Globcs-Held."  The  value  of  “Globes-Held"  is 
again  a list  of  the  names  of  the  globes  currently  held  by  the  monster  in  question. 


TABLE  t 

A Monster  Problem 

Three  live-handed  extraterrestrial  monsters  were  holding  three  crystal  globes.  Because  of 
the  quantum-mechanical  peculiarities  of  their  neighborhood,  both  monsters  and  globes 
come  in  exactly  three  sizes  with  no  others  permitted  small,  medium,  and  large  The 
medium-sized  monster  was  holding  the  small  globe;  the  small  monster  was  holdmg  the  Urge 
globe,  and  the  largest  monster  was  holding  the  medium-sized  globe.  Since  this  situation 
offended  their  keenly  developed  ;>nv  of  symmetry,  they  proceeded  to  teleport  globes  from 
one  to  another  so  that  each  monster  wo  tld  have  a globe  proportionate  to  his  own  size. 

Monster  etiquette  complicated  the  soi  ion  of  the  problem  since  it  requires 

1 that  only  one  globe  can  be  transmuted  at  a time, 

2 that  if  a monster  is  holding  two  globes,  he  may  transmit  only  the  larger  of  the  two 
and 

?.  that  a globe  must  not  be  transmitted  to  a monster  who  is  holding  a larger  globe. 

By  what  sequence  of  teleportations  could  the  monsters  have  solved  this  problem? 
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TABLE  2 

A R«pr«Mnm,on  of  th.  Initial  Situation  for  the  Montttr  Problem 
Curreni  Siiuaiion 
Monslei  Ltsl: 

Mens  let.  I 
Size  Medium 
Globe  lisr 
Globe.  I 
Size:  Small 

Monrier.  2 
Size:  Small 
Globe  list 
Globe.  2 
Size  Large 

Monster  3 
Size:  Large 
Globe  list: 

Globe.  3 
Size  Medium 


Hach  globe  is  described  by  its  size  ,n  the  same  way  as  the  monsters  are.  The 
entire  representation  for  the  initial  problem  situation  is  shown  in  Table  2. 

As  rourth  and  fifth  steps,  the  move  operator  must  be  identified,  and  it  must 
receive  a semantic  interpretation.  In  this  problem,  the  move  operator  may  be 
represented  as:  7 

GIV  KS(Globe.X,  Monster. Y,  Monster^). 

which  may  be  read  as.  “Monster  T gives  Globe  X to  Monster  Z " Notice  that  tl  is 
operator  involves  one  object  belonging  to  one  class  (globes),  and  two  objects 
e ongmg  to  another  class  (monsters).  This  characteristic  allows  GIVES  to  be 
interpreted  as  an  instance  of  the  MOVE  operato:  which  is  already  stored  in 
ong-term  memory,  along  with  programs  that,  when  provided  with  appropriate 
information  about  the  representation,  can  actually  carry  out  the  move 

MOVE  (A.\,B.\,3.2) 

The  operation  of  moving  A. I from  B I to  B 2 consists  in  deleting  the  relation  in 
the  represen tahon  of  the  situation  that  holds  between  A.\  and  B I , and  then 
es  abhshing  the  corresponding  relation  between  A I and  B.2.  How  this  is  to  be 

done  depends  on  the  way  in  which  the  relation  between  the  As  and  Jhe  Bs  is 
actually  represented  in  memory. 

The  sixth  step,  then,  ,s  to  provide  the  MOVE  operator  with  the  information  it 
e.ds  in  order  to  act  upon  the  representation  of  the  situation.  This  is  accom- 
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plished  by  constructing  a description  of  the  representation  (a  formalized  version 
of  the  English-language  description  we  provided  a few  paragraphs  earlier).  Using 
this  description  the  MOVE  operator  can  now  interpret  “Monster  Y gives  Globe 
X to  Monster  Z"  as  meaning  that  Globe  JC  is  to  be  deleted  from  the  value  list  of 
the  attribute  Globes  Held  of  Monster.  Y,  and  is  to  be  added  to  the  value  lisi  of 
the  attribute  Globes  Held  of  Monster.?.  If  the  situation  had  been  represented 
differently -by  storing  the  list  of  globes,  say,  and  associating  with  each  the  name 
of  the  monster  holding  it— then  the  actual  operation  of  making  a move  would 
have  been  altered  accordingly.  In  tliat  case,  the  move  would  have  been  made  by 
deleting  the  donating  monster’s  association  with  the  transferred  globe,  and 
associating  the  receiving  monster  with  that  globe.  In  each  instance  the  processes 
of  the  MOVE  operator  would  be  controlled  by  the  description  of  the  particular 
representation  in  use.1 

In  summary,  the  UNDERSTAND  theory  asserts  that  a person  understands  a 
problem  like  the  Monster  Problem  if  he  has  constructed  a representation  for 
problem  situations  (poss'ble  distributions  of  globes  among  monsters),  and  if  he 
can  consider  changes  in  the  problem  situation  corresponding  to  legal  meves  and 
actually  carry  out  those  changes  on  the  stored  representation.  The  reason  for 
regarding  this  as  undemanding  is  that,  equipped  with  this  interpretation  of  a 
problem,  a problem  solver  can  attack  it  by  applying  means-ends  analysis  to  it.  He 
can  determine  in  what  respects  the  goal  situation  (all  monsters  holding  globes 
proportionate  to  their  own  sizes)  differs  from  the  initial  situation,  and  how  these 
differences  might  be  removed  by  applying  the  MOVE  operator.  We  repeat  our 
earlier  caution:  to  say  that  the  problem  has  been  understood  does  not  imply  that 
it  can  be  solved  either  easily,  or  at  all. 

Oral  versus  Written  Instruction 

All  of  the  steps  in  the  understanding  process  that  have  been  described  are 
essential,  regardless  of  whether  the  task  instructions  are  received  orally  or  in 
writing.  We  cited  earlier  some  empirical  evidence  (Sticht,  1972)  that  for  adults 
the  difficult  of  understanding  instructions  is  largely  independent  ol  instruction 
modality.  However,  there  are  some  differences  between  oral  and  written  com- 
munication that  can  affect  understanding  substantially. 

The  most  important  distinction  is  that  oral  instructions  may  exceed  the 
capacity  limits  of  short-term  memory.  For  example,  it  would  bt  extremely 
difficult  to  understand  the  Monster  Problem  in  Table  I -particul;  rly  the  legal 
moves-on  a single  oral  presentation.  Subjects  who  are  given  the  problem  in 


1 Details  of  how  the  UNDERSTAND  program  carries  out  these  steps  ate  provided  in  our 
previous  paper  (llayes  & Simon,  1974).  In  that  paper,  we  also  discuss  how  the  rule 
conditions  (the  limitations  upon  who  may  give  globes  to  whom)  are  interpreted.  The 
method  is  very  similar  to  the  method  of  interpreting  the  move  operator. 
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writing  handle  the  STM  capacity  limitation  limply  by  reread*.1?"  tie  more 
complex  sentences  as  often  as  they  need  to.  The  sentences  stating  the  lit  > Stations 
on  transfers  of  globes  and  the  sentence  describing  the  initial  situation  are  almost 
always  read  a number  of  times  by  our  subjects.  The  other  sentences  are  usually 
read— and  apparently  understood— in  a single  pass.  The  UNDERSTAND  program 
makes  a number  of  successive  passes  over  the  problem  text,  and  is  allowed  as 
much  short-term  memory  as  it  requires.  Hence,  it  approximates  more  closely  the 
conditions  of  reading  then  of  listening. 

Another  difference  between  oral  and  written  instructions  is  that  the  former 
contain  stress  and  intonations!  cues,  while  the  latter  contain  punctuation. 
Although  these  differences  cou.’d  have  important  effects  upon  understandability , 
we  believe  that  in  fact,  the  effects  are  usually  small.  The  UNDERSTAND 
program,  as  it  now  stands,  nukes  use  of  punctuation  as  a cue  to  meaning,  but 
not  italics,  nor  stress  and  intonat'onal  cues. 

Environmental  Feedback 

Knowledge  of  results  is  crucial  to  ar  ’taming  process,  and  the  underttandin; 
process  should  be  i-o  exception.  What  knowledge  of  results  does  the  UNDER- 
STAND program  obtain  in  the  course  of  acquiring  an  understanding  of  the 
Monster  Problem*’  How  does  the  program  know  when  it  has  understood*’ 

The  program  can  detect  whether  it  has  succeeded  in  formulating  some  prob- 
lem, but  it  cannot  guarantee  that  the  formulation  is  the  one  intended  by  the 
problem  statement.  The  program’s  test  that  it  has  performed  its  task  amounts  to 
detecting  that  it  has  constructed  a representation  of  the  problem  situation,  that 
it  has  defined  a move  operator,  and  that  it  has  interpreted  the  move  operator  in 
terms  of  the  rep’esentation. 

In  some  real-1  fe  undemanding  situations,  and  in  many  school  situations, 
feedback  is  correspondingly  limited.  In  many  instruction-understanding  situa- 
tions, however,  tht  problem  solver  can  seek  cues  to  help  him  or  her  interpret  the 
problem  situation.  Subjects  solving  the  Monster  Problem  ask  the  experimenter 
such  questions  as:  “Can  a monster  hold  more  than  one  globe*!”  Does  the 
problem  have  a unique  solution?"  "May  a monster  pass  a globe  to  any  other 
monster,  or  only  to  the  monster  standing  nearrsl  to  him*’"  Answers  to  these 
questions  narrow  down  the  ^Aitsible  range  of  interpretations  of  the  problem. 

In  many  other  cases  of  real-world  problem  solving,  the  task  posed  by  the 
problem  instructions  is  to  carry  out  some  external  physical  actions  that  have 
observable  physical  effects.  These  observable  effects  become  another  source  of 
feedback  that  can  be  used  to  aid  problem  interpretation.  When  someone  tries  to 
follow  the  instructions  of  a do-it-yourself  kit,  the  success  or  failure  of  his  actions 
soon  becomes  apparent,  and  this  gives  him  critical  information  as  to  the 
adequacy  of  his  understanding  of  the  instructions.  If  the  instructions  refer  to  a 
particular  component,  and  he  cannot  identify  that  component  from  the  parts 
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lying  in  front  of  him,  he  knows  that  he  does  not  understand  Use  instructions.  If 
the  instructions  require  two  parts  to  be  assembled  that  he  cannot  fit  together,  he 
knows  that  he  does  not  understand  the  instructions. 

In  the  limiting  case  there  are  no  verbal  instructions  at  all  -only  a situation  to 
be  undr -stood  and  dealt  with.  A leaking  faucet  is  an  instruction  “Fix  me  ” This 
limiting  .-sc  reminds  us  that,  in  many  real-life  situations,  information  fo, 
understanding  what  is  to  be  done  is  drawn  leu  fawn  iKphrit  instructions  than 
from  direct  perception  of  the  situation  (It  is  uraw.i  also,  of  course,  from 
long-term  memory,  a point  that  we  shall  discuu  later.). 

In  some  cases,  explicit  instructions  are  accompanied  by  worked-out  examples 
This  is  done  commonly  in  mathematics  textbooks,  computer  progtainming 
manuals,  and  instructions  for  subjects  in  psychological  experiments.  In  these 
situations  the  examples  may  supply  sufficient  information  about  the  task  and  its 
requirements  to  make  the  explicit  task  instructions  superfluous  Thi:  possibility 
has  been  demonstrated  by  Donald  Williams  ( 1972),  who  constructed  an  artifi.~'at 
intelligence  system  that  programs  itself  to  take  various  kinds  ol  intelligence  test 
battenes  (e.g.,  letter  series  completion,  number  analogy)  on  the  basis  of  worked- 
out  examples,  and  without  explicit  verbal  instructions. 

Perhaps  the  most  interesting  cases  are  those  in  which  a subject  receives 
feedback  from  the  conaequences  of  his  attempts  to  formulate  and  solve  the 
problem.  For  example,  a subject  (described  by  Mayes  & Simon.  I9',4)  read  a line 
of  text  and  concluded  that  the  problem  involved  two  participants.  The  next  line, 
however,  implied  that  there  were  three  participants  At  this  point,  the  subject 
carefully  reexamined  each  of  the  sentences  to  resolve  die  conflict  between  them 
before  proceeding  with  his  solution  attempt.  This  example  illustrates  a simple 
but  powerful  heuristic  for  reducing  local  ambiguity  that  is  not  incorporated  in 
the  present  UNDFRSTAND  program  inferences  drawn  from  one  part  of  the 
problem  text  must  be  made  consistent  with  inferences  drawn  from  other  parts  of 
the  text 

Another  example  of  the  use  of  this  heuristic  by  the  same  subject  involved 
interpreting  a statement  of  ihe  form,  "A  Joes  a task  for  B.  In  one  part  l,  i. 
text,  the  subject  interpreted  this  statement  to  mean  tha:  “A  does  a task  to 
benefit  B.”  while  in  another,  he  interpreted  it  to  mean  that  “A  does  a task 
instead  of  B " The  subject's  recognition  and  resolution  of  this  confhct  playec  an 
important  part  in  his  successful  formulation  of  the  problem 
The  consistency  heuristic  is  not  the  only  one  that  subjects  use  in  denving 
feedback  from  their  attempts  to  solve  a problem.  We  have  also  observed  a 
“solvability"  heuristic  being  employed  Briefly,  if  worked  as  follows  the  subject 
hypothesized  an  operator  for  a problem  Then  he  discovered  that  the  problem 
couldn’t  be  solved  with  this  operator.  Me  then  concluded  that  the  operator  was 

inappropriate,  and  hypothesized  a new  one. 

In  summary,  the  task  of  understanding  instructions  when  there  is  environ 
mental  feedback  is  very  different  from  the  understanding  task  in  the  absence  of 
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such  feedback.  Feedback  may  resuit  from  ability  to  question  the  instruction- 
giver,  access  to  the  actual  problem  environment,  access  *o  workedout  examples, 
or  the  use  of  consistency  and  solvability  heuristics.  In  limiting  cases,  the 
information  available  from  these  sources  of  feedback  may  be  so  complete  as  to 
rermit  the  task  to  be  understood  without  any  explicit  instructions  at  all.  Before 
it  can  be  regarded  as  a comprehensive  theory  of  the  understanding  processes,  the 
UNDERSTAND  program  will  have  to  be  augmented  to  handle  these  possibilities 
for  feedback. 


KNOWLEDGE  AND  UNDERSTANDING 

What  does  the  problem  solver  have  to  know  already  in  ordt  r to  understand  a set 
of  problem  instructions'*  The  description  above  o<  the  processes  used  to  under- 
stand the  Monster  Problem  imply  that  the  knowledge  require’itents  tor  under- 
standing are  relatively  limited,  this  kind  of  puzaie  may  be  atypical,  however,  in 
the  slight  demands  it  makes  upon  stored  information 


III  Structured  Problems 

In  the  literature  of  problem  solving,  a distinction  is  often  made  between 
well-structured  problems,  on  the  one  hand,  and  ill-structured  problems,  on  the 
other.  (Newell,  1969.  Reitman.  1964.  1965.  Simon.  1973)  Not  all  authors  have 
defined  the  distinction  in  exactly  the  same  way.  but  a common  theme  running 
through  all  of  the  definitions  is  that  problems  are  ill  structured  to  the  extent 
that  the  problem  solver  hinrelf  must  contribute  toward  their  definition.  Specif- 
ically. we  propose  that  a problem  be  regarded  as  ill  structured  to  the  extent  that 
the  subject  must  build  the  representation  he  uses  to  solve  it  from  information 
generated  dunng  his  unsuccessful  attempts  at  solution. 

The  distinction  between  well-structured  and  ill-structured  problems  describes  a 
continuum  and  noi  a dichotomy  Moreover,  the  ill -structuredness  of  a problem 
may  take  a variety  of  forms  The  problem  instructions  may  br  couched  in 
technical  language  that  can  be  understood  only  by  one  “skilled  in  the  art  Here 
is  an  example,  from  a chemical  engjneenng  textbook,  of  a problem  that  appears 
ill  structured  primanly  tc. cause  of  its  technical  vocabulary 

A thrmding  cdonnwler  i«  attached  lo  a line  conlauun*  Ream  al  IS  po*  •vd>  a 

of  98%.  Whal  does  the  thermometer  in  the  calonmeier  read5 

A problem  may  also  be  ill  structured  because  the  instructions  do  not  contain 
enough  information  to  permit  a usable  problem  representation  to  be  inferred 
from  them  As  a matter  of  fac*  the  chemical  engineering  eiuimple  illustrates  this 
potentiality  for  ill-stnicturednec  also.  There  is  nothing  in  thr  problem  statement 
that  indicates  how  the  temperature  of  steam  can  be  calculated  from  its  pressure 
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sid  quality— no  definitions  of  “move  operators"  are  supplied.  They  mini  come 
from  the  reader*  knowledge.  If  this  knowledge  i*  readily  available  to  the  reader, 
the  prrMem  will  appear  well  itructured  to  him,  if  accessing  the  knowledge  tuelf 
ralk  for  extensive  problem -aolving  effort*,  the  problem  will  appear  ill  itructured 
to  him  (See  Norman.  Centner . & Steven*.  Chapter  9 of  this  volume  > 

A*  one  of  hi*  principal  example*  of  an  ill-structured  problem.  Retunin  (1965. 
Chapter  6)  examined  the  ta*k  of  wnting  a fugue.  Here  the  instruction*  are 
amp  lie  > itself  "compose  a fugue."  Directed  to  a professional  composer,  the 
instruction*  define  a goal,  foe  he  can  recogniae  a fugue  when  he  Itfarsone,  and 
can  apply  a variety  of  teit*  to  evaluate  it*  quality  a*  music  The  in* t ructions  alto 
digest  to  the  composei  a method  of  peoceedmg.  for  they  evoke  from  h« 
long-term  memory  a whole  organised  system  of  compotitional  techniques  If  the 
composer  doe*  not  set  hi*  quality  criterion  too  high,  com  prong  a fugue  may 
even  be  a rathet  routine  activity  from  his  itandpoint.  the  problem  doe*  not 
really  ipoear  to  be  ill  stiuctured  at  all 

Understanding  Pu/rles 

The  examples  • ited  above  show  that  iBstiucturednes*  t*.  in  a certain  sense,  in 
the  eye  of  the  beholder  Perhaps  the  best  way  to  put  it  is  to  say  that  a problem  is 
til  structured  to  the  extent  that  it  put*  demand*  on  the  knowledge  and  repertory 
of  problemsolving  skills  of  the  sober  In  these  terms,  we  would  say  that  the  task 
of  composing  a fugue  is  quite  fat  along  toward  rh>;  ill  structured  end  of  ix 
contsiuum.  not  because  ihe  piofesrvorsal  composer  would  nor  know  hm  to 
proceed,  but  because,  m proceeding  he  would  have  to  draw  upon  a large 
repertory  of  knowledge  and  skill  ilored  in  hi*  : eig-teun  memorv . and  much  of 
jhn  might  become  accessible  only  on  the  course  of  hi*  solution  effort*  for  the 
person  unskilled  in  composition,  the  problem  would  he  ill  itructured  m the  more 
radical  tense  that  he  would  umply  lack  the  knowledge  that  would  per  mi  him 
even  to  formulate  the  problem,  much  less  to  solve  it 
Looking  at  the  matter  of  structuredness  in  t his  light,  we  can  ask  how  well- 
structured  the  Monster  Problem  is  for  the  USDERST AND  program  *hat  stored 
knowledge  does  the  program  draw  upon  m order  to  understand  the  problem’ 
The  UNDERSTAND  program  depends  pci  manly  upon  a know  ledge  of  the 
English  Language  and  recoodarih  upon  a knowledge  of  the  semantics  of  a few 
basis  operators  (in  the  case  ar  hand,  the  MOVi1  opeiatorl  In  addition  to  this 
knowledge  it  has  a general,  task -independent,  capability  for  constructing  repre 
sen  tat  ions  f om  sets  of  objects  and  relations  and  for  inrerpretmt  its  hasic 
operators  in  ternr  of  these  representations  Except  for  function  words  leg 
ptpouiKins.  lonjunctKms.  auxiliaries*  which  seise  as  cues  to  tdenttfs  tlie 
syntactK  structure  the  UNDERSTAND  program  does  roi  need  to  know  the 
meaning  of  English  words,  bur  only  the  parts  of  speech  to  which  they  belong 
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"Mora ter"  could  as  wtU  be  “Jabberwock”  or  “thing*  mxjig”  or  “blurb"  without 
making  the  slightest  diffeirnce  to  the  program’i  operation.  As  we  have  seen 
earlier,  giving  a globe  to  another  monster  (“outgrabmg  a tove  to  a blurb”)  has  to 
be  recognized  as  an  action  that  can  be  matched  to  the  MQVfc  operator.  Beyond 
this,  the  meaning  of  “giving"  or  “outgrabmg"  is  also  irrriev?nt. 

It  is  rather  remarkable  that  this  small  amount  of  machinery  and  knowledge 
enables  the  program  tc  undent  Mid  the  instructions  of  ihe  Monster  Problem.  We 
generally  call  tasks  of  tins  kind  “pu rales."  It  is  perhaps  their  independence  of 
specific  information  not  contained  in  the  problem  statement  that  best  charac- 
terizes them.  However  much  they  are  couched  in  image-inductng  language,  they 
are.  m fact,  hghly  abstract,  and  the  processes  for  undemanding  them  are 
processes  for  identifying  and  further  purifying  the  abstraction.  Monsters  become 
objects  of  a certain  abstrac  dass.  globes,  objects  of  another  class,  “holding”  a 
relation  between  objects  of  the  first  dsns  and  objects  of  the  second,  and  so  on. 
SrmwiiK'  meaning  of  these  terms  is  a “cover."  to  be  stripped  off  in  the  process 
of  understanding  the  task 

The  protocols  of  subjects  attempting  to  understand  the  Monster  Problem 
reveal  the  role  of  abs:*action  in  the  understand!  process.  Some  subjects,  and 
especially  those  who  proved  most  proficient  at  the  task,  were  quite  explicit  in 
casting  out  semantic  irrelevancae*.  One  subject  reads,  “because  of  the  quant ur 
medwmcai  pecularitres  of  their  neighborhood,"  and  comments  at  once,  “Forget 
that  garbage  " Another  subject. after  reading  the  whole  text  once  says 

Now  *rrr  were  oMj  iSofces  *i»o*elWn  T1uf»  mot  thetottaL  They  were  fi.r- 

DMirn  gut  doetn'l  km  mdi  to  4o  with  it.  I take  It.  They  could  ha»e 

horn  n hiudrrt  woMlm  for  I ore 

Two  subject!,  on  the  other  hand,  who  imported  into  the  problem  knowledge 
associated  with  the  meanings  of  terms  failed  to  understand  it.  One  of  these 
subjects  became  preoccupied  with  the  physics  of  the  situation,  lie  made  such 
cofTwnenii  as  “You  can’t  have  two  energy  sutes  at  the  same  time,  according  to 
quantum  physics:  you’re  either  at  one  energy  level  or  another."  and  “the 
medium  wants  to  drop  it’s  energy  leeel  to  small  negative  and  positive -ahh 
energy  levels." 

The  other  subject  who  failed  to  understand  tie  problem  perhaps  sumulated 
by  the  mention  of  “monster  etiquette."  introduced  a social  dimension  into  the 
situation.  He  made  such  remarks  as.  “You  know,  n seems  lo  me  if  they  all 
agreed  on  an  arbitrary  standard  (of  size | . 1 don’t  x*  that  there  would  be  a 
problem  " The  subject  saw  that  such  colhaaon  among  the  monsters  would 
weaken  the  conditions  on  allowable  mows 

Precisely  because  the  Monster  Problem  w»  intended  a a pur/le  the  strategy 
of  making  use  of  the  semantics  of  its  terms  only  caused  <hfroult>  and  impeded 
undentaxfcng  of  the  problem  In  other  kinds  of  problem  utuatiora.  the  same 
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strategy  might  be  helpful,  or  even  essential.  In  the  puzzle  environment,  however, 
the  successful  subjects  were  those  who  applied  abstraction  techniques  like  the 
ones  incorporated  in  the  UNDERSTAND  program. 

The  distinction  beiween  puzzles  and  problems,  like  that  between  structured 
and  ill-structured  problems,  describes  a continuum  rather  than  a dichotomy. 
Some  puzzle  problems  require  the;  use  both  of  semantic  knowledge  and  abstrac- 
tion. The  Jabberwocky  problem  discussed  by  Newell  and  Simon  (1972)  is  just 
such  a problem.  In  this  puzzle  a familiar  river-crossing  problem  is  coded  in 
nonsense  words.  Thus,  “a  heavy  father  and  two  young  sons  have  to  cross  a swift 
river ...”  is  coded  as  “a  sli  thy  tove  and  two  mimsy  borogroves  have  to 
out-wiffle  a frumious  bandersnatch."  Subjects  attempting  to  solve  the  problem 
frequently  interpret  the  string  “to  out-wiffle  a bandersnatch”  to  mean  that  the 
participants  have  to  catch  or  kill  some  creature.  This  interpretation  suggests  an 
operation  that  is  completed  when  any  one  of  the  participants  accomplishes  it, 
rather  than  when  all  three  do  so.  It  is  clear  then  that  the  semantics  of  groups 
crossing  rivers  is  important  for  the  correct  formulation  of  the  problem. 


Semantics  in  Algebra  Word  Problems 

Word  problems  or  story  problems  in  algebra  provide  an  example  of  a problem 
domain  where  semantic  information  may  be  used,  even  though  the  problems  can 
be  understood  with  rather  little  reliance  upon  semantics.  Several  years  ago, 
Bobrow  (1968)  constructed  an  artificial  intelligence  program  for  solving  such 
problems  that  relied  primarily  upon  syntactic  cues.  The  program  operated  in  two 
stages:  first  it  translated  the  story  into  algebraic  equations;  then  it  solved  the 
equations. 

Bobrow’s  program  needed  to  be  able  to  interpret  a small  number  of  mathe- 
matical terms,  for  example,  “equals”  and  its  synonyms,  “four  times  as  great  as, 
“less  than,”  “twice  as  many,”  the  names  of  the  numerals,  and  so  on.  It  also  had 
some  special  semantic  capabilities  for  handling  age,  rate,  and  distance  problems. 
Beyond  this,  it  depended  entirely  upon  syntactic  means  for  its  translations. 
Hence,  its  semantics  was  not  much  more  elaborate  than  the  semantics  currently 
incorporated  in  the  UNDERSTAND  program. 

Subsequently,  Paige  and  Simon  (1966)  examined  the  protocols  of  human 
subjects  working  algebia  word  problems  to  see  whether  the  processes  they  used 
resembled  those  incorporated  in  Bobrow’s  program  They  constructed  a number 
,jf  problems  that  represented  physically  impossible  situations.  For  example. 

The  number  of  quarter!  a man  has  is  seven  times  the  number  of  dimes  he  has.  The  value 
of  the  dimes  exceeds  the  value  of  the  quarters  by  two  dollars  and  fifty  cents.  How  Many 
has  he  of  each  coin? 

Subjects  responded  in  three  distinct  ways.  Some  subject.,  made  a literal  (syn- 
tactic) translation  of  the  problems.  In  the  example  quoted  above,  they  arrived  at 
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the  equation 


IQ*  = 250  + 25<7JQ. 


Other  subjects,  apparently  noting  that  the  value  of  the  quarters  must  be  greater 
than  the  value  of  the  dimes,  simply  assigned  the  extra  two  and  a half  dollars  to 
the  other  side  of  the  equation  Their  solution  made  sense  of  the  original 
problem,  but  did  nr.  represent  an  accurate  translation  of  the  problem  statement. 
A third  group  of  u'ajects  objected  that  they  found  the  problem  contradictory 
They  could  not  r n ondle  their  (accuratt)  translation  of  the  problem  statement 
with  their  interpretation  of  the  physical  re.  'jty  of  the  situaix>n 
This  r ..iment  demonstrated  a conndei.:ble  variation  among  human  subject: 
in  their  relative  reliance  upon  syntactic  and  semantic  cues  in  doing  algebra  word 
problems  It  also  shows  that,  when  the  subject’s  semantic  knowledge  is  taken 
into  account,  many  problems  have  considerable  redundancy  Different  under- 
standing processes  may  accomplish  their  task  by  making  use  of  different  parts  of 
the  redundant  information.  Mence,  from  the  fact  of  understanding,  one  cannot 
inter  a unique  prog  im  that  brought  about  that  understanding 


Styles  in  Understanding 

In  the  complex  tasks  handled  by  professionals,  only  a small  pari  of  the  task 
information  comes  from  explicit  instructions  The  vast  bulk  of  this  information 
is  retrieved  from  the  professional’s  long-term  memory  , where  training  and 
experience  have  placed  it.  A few  studies  have  begun  to  disdose  the  nature  of  this 
professional  knowledge,  and  how  it  is  used  in  proofem  solving.  Clarkson  ( I <*63) 
simulated  the  professional  decision  making  of  a bank  trust  >fficei.  wrhtle  the 
decision  making  or  chess  masters  has  been  studied  by  de  Grout  <1965,  1966). 
Newell  and  Simon  (1972)  and  Chase  and  Simon  1 1973a.  b).  Ike  will  need  rruny 
more  such  studies,  over  a whole  range  of  professions,  to  explore  and  characterize 
the  variety  of  understanding  processes  used  by  professionals 
Parallel  to  the  differences  among  problem  domains  are  differences  in  the 
strategies,  or  styles,  that  problem  solvers  may  employ  in  icekmg  to  under****;.' 
instructions  The  success  of  a particular  style  may  depend  cm  its  appropriate 
to  the  problem  domains  to  which  it  is  applied  For  purposes  of  discussam.  *e 
may  consider  a style  that  emphasizes  abstraction  processes,  on  ih«tempha»rr 
trieval  from  semantic  memory  . and  one  that  emphasizes  meta.Stor  and  eaeljg- 
In  the  previous  section,  the  first  two  of  these  styles  have  already  'ecer  *4  «onx 
attention.  We  saw  that  success  in  understanding  a pu* tie  ’ike  pfurdem  epends 
on  skill  in  abstracting  and  in  disregarding  irrelevant  setnmti.'  mierprei”  » is  «*• 
the  problem  vocabulaij  On  the  other  hand,  ve  saw  (hat  more  cuptpux  (*«** 
lems  are  generally  inscrutable  without  semantic  imerpretation  The  tla'd  style 
that  employing  analogy  and  metaphor  needs  dime  additional  dthrutfem 
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requires  that  there  be  some  redundancy  between  the  two  passages.  In  extolling 
the  values  of  brevity,  manuals  on  good  writing  do  not  generally  make  explicit 
the  distinction  between  eliminating  irrelevancy  and  eliminating  redundancy. 

The  theory  alerts  us  to  the  semantic  information  that  is  needed  to 
interpret  instructions  successfully,  and  suggests  an  approach  toward  systematic 
identification  of  that  information  by  analysis  of  the  problem  structure.  The 
instructions  can  then  be  tested  for  completeness— to  see  whether  they  contain 
the  semantic  information  that  the  reader  cannot  be  assumed  to  have  already 
available.  For  example,  Bobrow’s  program  for  algebra  word  problems  could  not, 
if  it  were  deprived  of  the  formula,  D = R X T,  solve  rate  problems.  Constructing 
an  artificial  intelligence  program  to  perform  the  task  would  provide  an  iventory 
of  the  semantic  information  required  for  performance.  Similarly,  submitting  a 
set  of  task  instructions  to  the  UNDERSTAND  program  would  reveal  the  seman- 
tic information  needed  for  understanding  those  instructions. 

Thus,  not  only  might  theory  illuminate  the  nature  of  clarity  in  writing,  and 
ways  of  attaining  it,  but  artificial  intelligence  programs,  through  their  further 
development,  could  be  used  to  analyse  the  structure  of  particular  classes  of 
problem  environments,  and  to  “debug”  instructions  by  identifying  points  of 
difficulty  in  them. 


Teaching  the  Skills  of  Understanding 

It  is  unrealistic  to  think  that  we  can  make  more  than  a dent  in  the  problem  of 
functional  illiteracy  by  simplifying  and  otherwise  improving  the  quality  of  oral 
and  written  instructions.  At  the  same  time  we  seek  to  improve  the  performance 
of  the  speaker  or  writer,  we  must  seek  to  increase  the  ability  of  the  listener  or 
reader  to  comprehend  complex  prose.  On  the  basis  of  the  theory  of  understand- 
ing set  forth  here,  we  can  sketch  out  some  plausible,  but  untested,  ideas  as  to 
how  this  job  might  be  tackled. 

To  the  extent  that  understanding  problem  instructions  involves  semantic 
knowledge  abiut  a problem  domain,  there  is  no  substitite  for  having  tne 
requisite  knowledge.  A person  cannot  interpret  algebra  word  problems  unless  he 
knows  v hat  suji  phrases  as  “twice  as  many”  moan.  Training  in  understanding 
problem  instructions  will  not  teach  him  this;  training  in  arithmetic  or  algebra 
might  v 

Where  semantic  information  is  involved,  learning  to  understand  problem  in- 
structions cannot  be  separated  from  learning  the  subject  matter  of  the  problems. 

To  the  extent  that  functional  illiteracy  stems  from  lack  of  such  knowledge,  no 
amount  of  training  in  how  to  read  or  listen  will  remove  it. 

But  there  is  no  reason  to  suppose  that  all  inadequacies  in  interpreting  problem 
instructions  stem  from  deficiencies  in  subject  : natter  knowledge.  In  the  example 
of  the  Monster  Problem,  we  saw  that  some  subjects  failed  to  arrive  at  a suitable 
problem  representation,  not  because  they  lacked  information,  but  because  they 
dragged  in  irrelevant  information  instead  of  abstracting  out  the  essential  problem 


i 


14.  UNDERSTANDING  COMPLEX  INSTRUCTIONS  285 


elements.  Their  difficulties  might  have  been  reduced  by  appropriate  training  in 
the  skills  of  interpreting  instructions. 

A first  topic  of  instruction  might  be  styles  of  interpretation.  An  awareness 
could  be  developed  of  the  need  to  select  a style  of  attack  appropriate  to  the 
problem  domain,  and  practice  could  be  given  in  changing  style  in  response  to 
cues  imbedded  in  the  problem  instructions. 

A second  topic  of  instruction  might  be  the  abstractive  style  itself.  The 
UNDERSTAND  program  provides  a framework  for  attacking  instructions  by 
abstraction.  Practice  could  be  given  in  identifying  important  sets  of  objects 
mentioned  in  the  problem  instructions,  identifying  relations,  constructing  a 
representation  of  the  situation,  identifying  the  operators  and  conditions. 

A third  topic  of  instruction  might  be  training  in  the  style  that  maximizes  the 
use  of  semantic  cues  and  analogy.  At  the  moment,  we  have  little  to  suggest 
about  the  specifies  of  doing  this. 

A fourth  topic  of  instruction  might  be  the  skills  of  obtaining  feedback  from 
the  instruction  giver  or  from  the  task  environment.  Practice  could  be  given  in 
clarifying  task  instructions  by  asking  questions,  by  making  solution  attempts  to 
identify  ambiguities,  by  searching  for  and  exploiting  redundancies  in  the  instruc- 
tions themselves.  The  field  of  computer  programming  might  be  a useful  and 
appropriate  domain  within  which  to  practir  .h  skills.  Programming  problems 
can  be  proposed  at  any  desired  level  of  difficulty  and  clarity.  Programming 
manuals  are  inexhaustible  sources  of  unclear  (and  occasionally  clear)  problem 
instructions.  The  computer  itself  provides  a real-world  environment  in  which 
understanding  can  be  tested  and  feedback  obtained.  The  idea  of  using  computer 
programming  as  a domain  for  teaching  the  skills  of  understanding  instructions  is 
not  unrelated  to  the  proposal  and  experiments  of  Papert  and  Minsky  (see  Papert, 
1971),  who  use  programming  as  a domain  for  teaching  general  problem-solving 
skills.  (See  also  Norman,  Gentner,  & Stevens,  Chapter  9,  this  volume). 

No  doubt  these  are  only  a fraction  of  the  possibilities  for  specific  instructional 
plans  for  raising  the  skills  of  understanding  instructions.  As  we  acquire  more 
adequate  theories  of  the  understanding  process,  our  abilities  to  construct  effec- 
tive training  procedures  should  increase.  In  this  chapter  we  have  tried  only  to 
suggest  some  general  lines  of  attack  on  the  problem;  we  are  under  no  illusion  that 
we  have  solved  it.  It  is  well  worth  solving,  because  it  is  deeply  implicated  in  the 
extent  and  severity  of  functional  illiteracy  in  our  society. 
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Education  and  Understanding 
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Education  has  typically  dealt  with  the  problem  of  understanding  only  in  terms  of 
ITnXicXA,  to,  i,  » »,  of  leambig  which  go  w.^to., 

printed  strings  of  letters.  The  two  chapters  by  Just  and  Carpenter,  Chapter  13  and 
Ey  Simon  and  Hayes,  Chapter  14,  deal  with  the  problem  of  nn 
two  quite  different  ways,  both  distinc,  from  the  mechanics  of  reading.  If  Simon 
and  Hayes  are  correct  that  the  difficulties  adults  have  in  reading  are  deeper  than 
to  SL  level,  ton  surely  it  would  be  worthwhile  to > «* 
understanding  skills  as  well  as  reading  skills.  So  if  « is  possible  to  ana^ype 
understanding  in  deeper  terms  than  the  mechanics  of  reading,  the  implications 

for  education  should  be  important. 

Just  and  Carpenter  deal  with  the  problem  of  understanding  principally  at  the 
level  of  the  individual  sentence.  The  data  they  present  to  support  their  model  of 
sentence  processing  is  very  convincing.  Underlying  their  model  is  a companson 
S process,  wfcch  they  treat  as  a simple,  unitary  step  that  can  be 
repeated  different  numbers  of  times.  1 suspect  Aeir  companson  model, 
basically  correct  and  I have  argued  elsewhere  (Collins  & Quilhan,  1972)  th 
such  a companson  process  pervades  all  of  human  language  processing. 

In  the  tasks  Just  and  Carpenter  have  used,  the  comparison  process  vv°uld 
quite  simple,  and  assigning  it  a fixed  duration  as  they  do  makes  sense.  But 
reader  should  not  be  misled  into  thinking  that  it  is  always  a s.mple  process  or 
that  it  has  a fixed  duration  in  general.  This  could  be  shown  even  in  the  bnck  of 
tasks  Just  and  Carpenter  have  been  using.  For  example,  consider  the  task where 
a sentence  such  as  “The  dots  are  red”  is  compared  against  a picture  of  red 1 dot 
One  manipulation  'hat  should  affect  the  duration  of  the  companson  process  is 
the  perceptual  similarity  of  the  color  in  the  sentence  and  in  the  P>c  uJ^  f t^ 
dots  are  red,  for  example,  sentences  such  as  “The  dots  are  purple  or  The  dots 
are  pink”  should  take  longer  to  reject  than  “The  dots  are  yellow  This  kind  of 
example  can  be  extended  into  the  semantic  domain.  If  a picture  has  red  do  , 
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verifying  a sentence  like  “The  circles  are  red”  or  “The  squares  are  red”  involves 
deciding  whether  dots  can  be  circles  or  squares.  Similarly  if  the  question  is 
whether  “The  dots  are  crimson,”  “The  dots  are  maroon,”  or  “The  dots  are 
colored,”  this  involves  deciding  whether  crimson,  or  maroon,  or  colored  can  be 
the  same  as  red.  The  point  is  simply  that  the  comparison  process  itself  can  be 
complex,  involving  subprocesses  of  different  durations.  Just  and  Carpenter’s 
fine-grain  analysis  of  processing  may  be  extendable  to  analysis  of  the  comparison 
process  itself. 

There  is  one  other  question  I might  address  with  respect  to  Just  and  Car- 
penter’s chapter:  that  is,  are  they  dealing  with  the  real  problems  of  under- 
standing? The  answer  is,  l think,  that  they  are  dealing  with  some  of  them.  But 
there  are  other  even  more  difficult  problems  that  they  have  avoided.  The 
assumptions  of  their  verification  studies  is  that  the  relevant  knowledge  is  directly 
stored  as  a single  entity.  In  their  studies  this  is  so,  because  they’ve  made  it  so, 
but  in  real  life  it  is  usually  not  the  case.  Often  answers  must  be  inferred  from 
several  pieces  of  knowledge  scattered  about  in  memory.  Even  when  Just  and 
Carpenter  analyze  reading  comprehension  t;sts,  they  have  not  tried  to  deal  with 
how  the  reader  relates  the  information  in  a paragraph  to  his  various  kinds  of 
knowledge  about  the  world.  These  are  probelms  that  are  probably  beyond  the 
scope  of  the  fine-grained  analysis  which  Just  and  Carpenter  are  using.  But  they 
are  susceptible  to  the  kind  of  analysis  that  Simon  and  Hayes  are  attempting  in 
their  chapter.  Simon  and  Hayes  analyze  what  it  means  to  understand  the 
statement  of  a problem.  Their  discussion  of  ill-structured  and  well-structured 
problems  directly  addresses  the  issue  of  how  much  knowle  dge  a reader  brings  to 
bear  in  understanding  a problem.  It  is  perhaps  one  of  the  most  important  points 
in  the  chapter.  As  Simon  and  Hayes  point  out,  a problem  is  ill  structured  to  the 
degree  a person  needs  to  use  knowledge  beyond  that  which  is  in  the  statement  of 
the  problem,  either  to  understand  or  solve  it.  The  distinguishing  feature  of 
problems  like  the  monster  problem  they  have  been  working  with  is  that  relatively 
little  special  knowledge  is  needed  to  understand  the  problem  It  is  a well- 
structured  problem. 

Is  life  like  a monster  problem?  Again  1 want  to  be  wishy-washy  and  say  yes 
and  no.  I think  people  spend  much  of  their  time  problem  solving,  often  at  a 
subconscious  level.  For  example,  problem  solving  turns  up  when  people  try  to 
answer  questions  to  wh'ch  they  do  not  have  prestored  answers  (Collins,  War- 
nock,  Aiello,  & Miller,  1975).  Therefore,  one  of  the  aims  of  education  should  be 
to  teach  people  how  to  understand  and  solve  problems  as  effectively  as  possible. 
This  attitude  reflects  the  views  of  Simon  and  Hayes  and  Papert  (1972)  of 
education.  But  there  is  a difference  between  the  two  views.  Papert  tends  to 
discount  the  teaching  of  factual  knowledge  as  a legitimate  goal  of  education, 
whereas  Simon  and  Hayes’  view  stresses  the  importance  of  knowledge  for  dealing 
with  ill-structured  prooiems.  It  turns  out  that  most  of  life’s  problems  are  ill 
structured,  and  so  acquiring  and  using  factual  knowledge  is  crucial  to  under- 
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standing  most  problems.  It  is  because  of  this  that  life  is  not  like  a monster 
problem. 

Consider  the  kinds  i.i  real-world  difficulties  in  understanding  that  Simon  and 
Hayes  cite  at  the  beginning  of  their  chapter.  People  cannot  understand  instruc- 
tions on  soup  cans  because  they  do  not  have  enough  knowledge  about  cooking 
of  the  kind  one  acquires  from  practice,  or  knowledge  about  the  basic  terms  in 
cooking  like  “simmer”  or  “colander,”  or  knowledge  about  the  structure  of 
recipe  instructions  that  one  acquires  from  reading  recipes.  Alternatively,  con- 
sider the  problem  of  understanding  traffic  tickets.  There  Lie  difficulty  in 
understanding  may  derive  from  the  use  of  legal  language  and  concepts.  In  such 
cases  understanding  may  involve  knowing  what  to  ignore,  as  Simon  and  Hayes 
point  out  can  be  important  in  monster  problems.  But  knowing  what  to  ignore 
here  requires  a primitive  knowledge  of  law 

These  examples  emphasize  the  fact  that  monster  problems  pr  >bably  are  a good 
place  to  approach  the  problem  of  understanding,  because  th  semantic  knowl- 
edge needed  to  understand  them  is  relatively  limited.  But  the  difficulties  in 
understanding  soup-can  instructions  and  traffic  tickets  will  not  be  solved  by  a 
deep  analysis  of  monster  problems,  because  the  difficulties  arise  in  having 
enough  knowledge  about  the  world  and  using  it  appropriately  to  fill  in  the 
information  that  the  text  assumes. 

The  important  conclusion  from  all  this  is  that  there  is  no  easy  way  to  educate 
people  to  understand.  Because  life  is  full  of  ill-structured  problems  of  the  kind 
that  soup  cans  and  traffic  tickets  present,  we  need  to  have  a huge  amount  of 
world  knowledge,  together  with  the  kind  of  understanding  and  problem-so  ving 
skills  that  Simon  and  Hayes  or  Papert  advocate.  Papert’s  viewpoint  should 
probably  be  stressed,  because  school  teachers  have  only  tried  to  impart  world 
knowledge  and  not  these  other  skills.  3y  deemphasizing  the  teaching  of  world 
knowledge,  educators  may  be  induced  to  strike  a more  even  fiance.  But  in  the 
end  there  is  no  way  to  teach  people  to  understand  soup-can  instructions  without 
teaching  them  a lot  of  things  about  cooking. 
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I am  in  essential  agreement  with  Collins  (Chapter  15,  this  volume)  that  Chapter 
13  by  Just  and  Carpenter  is  a fine  piece  of  work  on  a very  difficult  problem. 
Therefore,  having  no  major  criticism  to  make,  1 address  my  commen  s 
broader  question  of  the  relevance  of  research  such  as  theirs  to  other  prob  em 
areas  of  psychology.  I would  like  to  describe  some  of  my  applied  research 
experience  in  aphasia  and  art  instruction  in  order  to  demonstrate  how  a consis- 
tent underlying  theoretical  orientation  has  lead  us  to  applied  instructional 
programs  in  two  widely  diverse  areas. 


TREATMENT  OF  APHASIA 

The  most  impressive  aspect  of  the  Just  and  Carpenter  work  is  the  fact  that  a very 
simple  model* seems  to  account  for  processing  latencies  *'™l«'**£* 
situations.  1 must  admit  that  I smarted  a little  upon  first  reading  their  con  nbu^ 
tion  because  I saw  that  they  had  succeeded  in  finding  a reasonably  simple 
linguistic  processing  model  for  various  sentential  transforms  where  attempts i at 
Minnesota  had  met  with  much  less  success  (Clifton,  Kurcz,  & Jenkins,  6 , 
Clifton  & Odom,  1966;  Walls,  1968). 

In  1966  Terry  Halwes  and  I developed  a partial  transformational  grammar  .or 
English  that  produced  the  structural  descriptions  for  elliptical  sentences.  In  a 
our  system  constituted  an  early  attempt  to  develop  a “question-answer 
mode’l  that  might  ultimately  be  used  as  part  of  natural  language  or  converse 
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tional  computer  program.  An  explanation  is  in  order  since  the  experimentation 
motivated  to  test  this  model  was  not  unlike  that  done  by  Just  and  Carpenter.  My 
primary  purpose  is  to  suggest  some  ways  in  which  basic  psycholinguistic  research 
in  general  may  be  fruitfully  applied  to  the  study  and  treatment  of  communica- 
tion, such  disorders  as  aphasia,  in  which  normal  language  processes  have  some- 
how become  dysfunctional. 

Even  though  our  success  in  applying  psycholinguistic  techniques  to  the  study 
of  aphasia  may  have  been  slight,  it  may  encourage  such  other  researchers  as  Jusi 
and  Carpenter  to  apply  their  more  adequate  techniques  to  the  study  of  such 
language  disorders.  My  main  point  is  that  the  treatment  of  such  disorders 
necessarily  involves  assumption  regarding  the  nature  of  instruction,  or,  more 
precisely,  ^instruction.  Speech  therapy  is  a form  of  reinstruction  of  people  with 
respect  to  a skill  once  possessed  but  now  lost,  and  as  such  is  indeed  a problem 
for  instructional  psychologists.  For  only  to  the  extent  that  instruction  is  under- 
stood can  techniques  for  reinstruction  realistically  hope  to  improve.  Thus,  the 
topic  examined  in  this  volume,  cognition  and  instruction,  is  vitally  important  to 
nearly  all  forms  of  clinical  therapy  in  which  restoration  of  a lost  skill  is  the  goal. 

As  is  so  often  the  case  in  basic  research,  Terry  Halwes  and  I did  not  even 
consider  the  potential  relevance  of  our  work  to  instruction  or  reinstruction  when 
we  were  developing  our  grammar  of  elliptical  sentences.  Perhaps  we  should  have, 
but  we  did  not.  By  relating  our  experience  1 hope  that  other  researchers  may  be 
made  more  circumspect  regarding  the  relevance  of  their  work  to  serious  applied 
problems.  I believe  that  volumes  such  as  this  one  surely  help  us  all  take  a much 
needed  step  in  that  direction. 

Halwes  and  I borrowed  from  the  transformational  grammars  that  were  then 
under  development  by  Chomsky  (1965)  and  Lees  (1960).  Our  grammar  differed 
from  theirs,  however,  in  that  it  reputed  to  capture  the  relationship  of  a 
declarative  sentence  to  all  the  WH  questions  that  might  be  asked  about  the 
content  of  the  sentence  as  well  as  all  the  elliptical  answers  that  might  be  given. 
For  instance,  consider  the  sentence.  The  little  zebra  nimbly  jumped  over  the 
stream , Restricting  ourselves  to  just  the  WH  questions,  some  of  ;he  questions 
that  might  be  asked  by  someone  who  failed  to  process  the  sentence  successfully 
when  first  presented  for  whatever  reasons  (distracted,  partially  deaf,  retarded,  a 
nonnative  speaker,  etc.),  may  ask:  What  did?  Did  what?  Which  Zebra?  Jumped 
H'/iere?  Over  what?  How?  etc.  The  corresponding  elliptical  responses  to  these 
ellipsizing  questions  might  be  offered:  The  zebra.  Jumped.  The  little  one.  Over 
the  stream  Nimbly.  Our  grammar  showed  that  the  same  transformations  used  to 
derive  a whole  sentence  could  tag  certain  sentential  constituents  in  such  a 
manner  that  either  of  two  kinds  of  transformations  might  be  applied  to  the  deep 
structure  forms,  an  ellipsizing  question  or  a correct  elliptical  answer  to  that 
question,  in  other  words,  an  “erasure”  transformation  applied  in  a compli- 
mentary fashion  to  a given  sentence  to  produce  either  a desired  question  or  a 
desired  answer.  Our  theory  established  what  we  called  the  transformational 
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relationship  of  grammatical  complimentarity  between  ellipsizing  questions  and 
the  elliptical  answers  they  evoked.  All  this  was  achieved  with  a minimal  modifi- 
cation to  Chomsky’s  (1965)  version  of  the  transformational  grammar  of  English 
and  without  destroying  any  of  the  desirable  properties  that  sue.,  a grammar  must 
have  if  it  is  to  contribute  to  a theory  of  language  comprehension. 

To  initially  test  our  grammar  we  showed  that  it  could  genente  over  98%  of  all 
the  elliptical  sentences  given  by  a large  number  of  adult  nonnal  subjects  when 
asked  the  total  set  of  possible  ellipsizing  questions  about  a wide  variety  of 
sentences  of  various  lengths  and  degrees  of  syntactic  complexity.  Later,  given 
the  success  of  this  initial  test  of  our  model,  we  attempted  to  determine  if  a 
similar  technique  might  not  shed  light  on  the  problem  of  how  linguistic  knowl- 
edge is  represented.  More  specifically,  we  wanted  to  determine  the  extent  to 
which  our  knowledge  of  sentences  was  represented  in  a form  possessing  some 
degree  of  abstract  similarity  to  their  transformational  description.  By  using  the 
ellipsizing  question  as  a probe,  we  hoped  to  show  that  the  latency  of  producing 
an  appropriate  elliptical  answer  was  proportional  to  the  number  of  operations 
used  in  its  derivation  by  the  grammar.  If  this  proved  to  be  the  case  then, 
following  a line  of  argument  made  popular  by  Chomsky,  our  modified  transfor- 
mational grammar  (mostly  Chomsky’s  and  Lee’s,  that  is)  would  be  shown  to  be  a 
valid  model  of  how  linguistic  knowledge  is  represented. 

To  this  end,  with  Virginia  Walls  (1968)  we  developed  what  we  called  the 
ellipsis  production  task  (EPT),  which  is  carried  out  in  the  following  way.  A 
subject  is  given  a deck  of  cards  with  a single  sentence  on  each  card.  The  subject 
memorizes  the  sentence  on  a given  card  turned  up  in  front  of  her.  The  subject 
then  turns  the  card  face  down  and  the  experimenter  asks  an  ellipsizing  question 
about  the  sentence  just  memorized  which  she  is  to  answer  immediately.  A voice 
key  is  tripped  by  the  offset  of  the  experimenter’s  voiced  question  and  again  by 
the  onset  of  the  subject’s  vocal  response.  Hence,  a measurement  is  taken  of  the 
time  required  for  the  subject  to  process  the  question. 

Our  primary  assumption  was  that  questions  probing  more  complex  answers 
embedded  deepest  in  the  sentence  would  require  the  most  time  to  process.  In 
contrast,  elliptical  answers  whose  derivations  were  simpler  should  take  less 
processing  time  to  retrieve.  Roughly,  our  hypotheses  were  borne  out  according 
to  a simple  count  of  the  number  of  rules  needed  to  derive  the  elliptical  response 
as  predicted  by  our  transformational  grammar  for  elliptical  sentences.  Unfor- 
tunately, the  derivational  “distance”  metric  selected  predicted  less  and  less  well, 
the  more  complicated  the  predicted  derivation.  It  is  here  that  the  Just-Carpenter 
model  seems  superior  to  our  earlier  attempt  to  measure  linguistic  processing 
times. 

Later,  while  working  at  the  Aphasia  Clinic  at  the  Minneapolis  Veteran  s 
Administration  Hospital  with  the  great  aphasiologist,  Hildred  Schuell,  William 
Brewer  and  1 attempted  with  some  success  to  apply  the  EPT  and  other  psycho- 
linguistic  techniques  to  the  study  of  the  aphasic  communication  disorder. 
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However,  owing  to  the  untimely  death  of  Hildred  Schuell,  the  research  project 
was  never  fully  completed.  Some  tentative  results,  however,  can  be  culled  from 
these  efforts. 

As  is  well  known,  it  is  very  difficult  to  get  reliable  measures  of  linguistic 
processing  latencies  from  aphasic  patients  because  the  time  it  takes  them  to 
produce  any  given  utterance  at  any  given  time  is  highly  erratic,  thus  giving  an  error 
term  that  renders  any  statistic  virtually  useless.  However,  their  pattern  of  errors 
on  linguistic  performance  does  seem  quite  reliable  over  tasks.  Roughly,  we  found 
that  the  errors  produced  by  adult  aphasic  patients  in  either  producing  sentences 
from  given  words,  repeating  sentences  presented  visually  or  orally,  or  in  answer- 
ing questions  about  sentences  available  in  front  of  them  (a  modified  version  of 
the  EPT)  were  correlated  with  the  syntactic  complexity  of  the  sentence  or  the 
deriviational  relationships  among  sentences-questions-answers.  This  suggests 
that  a correlation  should  exist  between  the  latency  measures  exhibited  by 
normal  native  speakers  in  processing  sentences  and  the  error  measures  exhibited 
by  aphasic  patients  on  similar  tasks  (Schuell,  Shaw,  & Brewer,  1969;  Sefer  & 
Shaw,  1972). 

The  attempt  to  apply  psycholinguistic  and  cognitive  principles  to  the  study, 
diagnosis,  prognosis,  and  treatment  of  aphasic  patients  forced  us  to  rethink 
seriously  what  we  believe  to  be  the  nature  of  normal  cognition  and  the  role  it 
plays  in  normal  communication.  In  Schuell’s  Aphasia  in  Adults  (Jenkins, 
Jiminez-Pabon,  Shaw,  & Sefer,  1975),  we  present  a functional  schema  for 
cognition  that  organizes  the  contribution  of  all  the  various  psychological  com- 
ponents we  believe  necessary  to  norma!  communication.  Our  theory  of  aphasia 
and  other  communication  disorders  provides  a characterization  of  each  type  of 
communication  disorder  in  terms  of  the  dysfunctional  relationships  or  inter- 
actions among  these  necessary  components  as  a result  of  trauma  to  the  system. 
In  other  words,  in  this  book  we  have  made  a serious  effort  to  define  the  norm 
from  which  aphasia  and  other  disorders  deviate  and  to  precisely  characterize  that 
deviation.  Furthermore,  it  is  with  respect  to  this  cognitive  approach  to  com- 
munication and  communication  disorders  that  I feel  the  Just-Carpenter  model, 
as  well  as  the  models  in  the  chapters  by  Simon  and  Hayes  and  by  Shaw  and 
Wilson  have  particular  relevance.  In  isolation  one  may  indeed  question  their 
relevance  to  current  instructional  problems  in  either  the  clinical  or  classroom 
setting.  However,  when  taken  together,  their  potential  relevance  to  the  realiza- 
tion of  a general  cognitive  model  for  communication-something  that  is  obvi- 
ously indispensable  to  the  development  of  a true  science  of  instructional  design- 
becomes  obvious  to  even  the  most  applied  psychologist. 

The  potential  relevance  of  basic  research  in  cognitive  psychology  to  applied 
problems  is  illustrated  in  the  case  of  the  generative  approach  to  conceptual 
knowledge  and  the  techniques  of  stimulative  therapy  developed  by  Schuell. 
Therapy  can  be  considered  a form  oft  ,'truction  when  the  latter  concept  is  as 
broadly  construed  as  1 feel  it  should  be.  The  stimulative  method  of  construction 
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whether  applied  in  a classroom  program  to  teach  or  improve  language  facility  of 
normal  speakers,  or  in  a clinical  program  to  treat  aphasics,  is  based  upon  a sound 
pedagogical  principle,  namely,  the  principle  that,  in  general,  one  should  stimu- 
late rather  than  correct.  The  idea  Schuell  repeatedly  proved  in  her  clinic  is  that 
defective  responses  tend  to  disappear  automatically  as  language  functions  in- 
crease. Thus,  the  time  of  the  clinician,  like  that  of  the  teacher,  is  better  spent  in 
stimulating  the  proper  use  of  all  language  modalities  rather  than  in  trying  to 
force  patients  to  modify  erroneous  responses  made  sporadically  along  the  way. 

The  success  of  the  stimulative  approach  depends  upon  the  existence  of  what 
Wilson  and  1 have  termed  the  “generative  cognitive  capacity”  for  the  following 
reasons.  Language  is  learned  originally  in  childhood  from  experience  with  a set 
of  linguistic  structures  that  is  much  smaller  and  less  systematic  than  the  dynamic 
whole  finally  achieved  by  adulthood.  The  clinician,  no  less  than  the  teacher, 
must  discover  what  language  functions  (vocabulary,  syntactic  forms,  sound 
patterns,  etc.)  the  patient  or  student  has,  and  use  these  as  the  material  by  which 
to  stimulate  or  restimulate,  as  the  case  may  be,  the  tognitive  processes  and 
physiological  synergisms  required  for  normal  language  processing.  For  instance, 
the  residual  language  remaining  for  most  adult  aphasic  patients  will  be  front  that 
area  of  knowledge  most  closely  related  to  the  patient’s  job  and  home  environ- 
ments. The  words,  concepts,  and  sentential  structures  that  are  most  salient  in 
those  subcultural  contexts  will  probably  contribute  the  greatest  amount  to  the 
patient’s  residual  language.  If  the  patient  is  a farmer,  then  his  residual  language  is 
most  likely  to  be  about  farming;  if  a doctor,  it  is  about  the  practice  of  medicine; 
if  a lawyer,  it  is  about  the  practice  of  law,  and  so  forth. 

Given  both  the  logical  necessity  and  the  empirical  evidence  for  the  existence  of 
a generative  cognitive  capacity  as  discussed  earlier  (Shaw  & Wilson,  Chapter  10 
of  this  volume),  what  could  be  more  natural  than  stimulative  therapy?  Here 
simple  exemplary  sentences  constructed  from  the  patient’s  residual  vocabulary 
are  given  to  him  for  practice.  The  materials  should  not  be  so  easy  as  to  require 
no  effort  nor  so  difficult  as  to  be  frustrating.  Rather  they  should  be  so  suited  to 
the  patient  at  a given  time  as  to  allow  for  a moderate  degree  of  success.  Thus,  in 
this  way  the  patient  is  allowed  to  build  from  success  to  success  at  gradually 
increasing  levels  of  linguistic  complexity.  Obviously,  then  what  is  sorely  needed, 
if  such  therapy  is  to  be  successful,  is  as  precise  a metric  as  possible  for  measuring 
the  difficulty  of  the  material.  It  is  here  that  the  demand  for  techniques  for 
measuring  linguistic  processing  latencies  and  linguistic  processing  errors  converge 
with  the  need  for  a technique  to  select  the  set  of  exemplars  to  be  practiced. 

The  sentences  constructed  from  the  residual  vocabulary  of  an  aphasic  patient 
and  given  to  him  for  practice  must  be  selected  to  be  both  exemplary  and  of 
adequate  difficulty,  if  they  are  to  stimulate  the  trauma  tically  deranged  cognitive 
system  to  reequilibrate  along  the  same  lines  as  the  functional  organization 
existing  before  the  trauma.  Since  language  is  itself  a generative  system  built  up 
originally  from  practice  with  perceiving  and  producing  exemplars,  restimulation 
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during  the  course  of  therapy  presumably  succeeds  only  if  the  cognitive  system 
retains  sufficient  generative  capacity  to  become  functionally  restructured  by 
generalizing  from  the  exemplars  experienced. 

Thus,  the  generative  model  for  concept  learning  also  provides  an  explanation 
for  why  stimulative  therapy  should  work  at  all.  Moreover,  this  model  provides  a 
new,  more  dynamic  theory  to  supplant  the  traditional,  essentially  static  theory 
offered  by  stimulus-response  psychology  for  both  positive  transfer  and  so-called 
“stimulus  generalization.”  Tire  wedding  of  this  generative  model  with  stimulative 
therapy  is  testimony  to  the  proposition  that  a symbiotic  relationship  can  exist 
between  pure  and  applied  psychology. 


THE  ROLE  OF  PERCEPTION 
IN  GENERATIVE  INSTRUCTION 

1 believe  perception  is  the  epigenetic  fount  of  all  knowledge  gathering  activities. 
By  this  1 mean  that  all  concepts  ultimately  have  their  developmental  origins  in 
the  process  of  perceptual  abstraction  of  invariant  information  from  events-what 
J.  J.  Gibson  has  railed  the  pickup  of  invariant  information  over  time.  But  since 
this  view  probably  runs  counter  to  the  more  constructivistic  account  of  knowl- 
edge championed  by  most  contributors  to  this  volume,  perhaps  I should  explain 
what  Wilson  and  1,  following  a Gibsonian  line  of  argument,  believe  to  be  the 
nature  of  the  epigenetic  process  by  which  abstract  concepts  are  specified  by 
perceptual  experiences.  Arter  doing  so,  1 will  relate  this  view  to  some  of  those 
discussed  in  this  volume. 

Our  basic  assumption,  what  might  be  called  the  event  perception  hypothesis,  is 
that  all  conceptual  knowledge  ultimately  has  its  origins  in  the  affordance 
structure  of  events.  By  the  affordance  structure  of  events  (Shaw,  McIntyre,  & 
Mace,  1974)  we  mean  the  detectable  invariant  information  determined  by  events 
that  specifies  the  true  properties  of  its  event  source  that  may  contribute  to  an 
animal’s  or  person’s  adaptative  behavior.  How  this  may  be  will  b«  explored  in  a 
moment. 

This  hypothesis  suggests  that  all  concepts,  even  the  most  abstract  ones, 
ultimately  refer  to  events  when  the  concept  of  event  is  broadly  construed.  For 
instance,  the  concepts  of  running,  smiling,  or  eating  may  be  exemplified  by 
events  involving  running,  smiling,  and  eating.  But  what  of  more  abstract  con- 
cepts such  as  love,  justice,  or  truth?  We  also  assume  that  these  concepts 
ultimately  derive  from  the  perceived  affordance  structure  of  events,  namely,  that 
the  concept  of  love  is  derived  from  perceived  instances  of  loving,  justice  from 
perceived  instances  where  justice  has  been  properly  administered,  and  truth  frem 
perceived  instances  where  somethi..g  believed  to  be  the  case  was  in  fact  shown  to 
be  the  case.  This  is  not  to  argue,  however,  that  each  concept  understood  by  a 
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particular  individual  must  be  learned  by  that  individual  from  experience  with 
actual  events;  for  surely  we  often  leam  vicariously  from  verbal  descriptions  or 
surrogate  forms  (movies,  pictures,  etc.),  of  other  people’s  experience,  as  well  as 
from  descriptions  or  representations  of  fictitious  or  inferred  entities  (e.g., 
stories,  plays,  models,  or  proofs). 

In  short,  much  of  our  conceptual  knowledge  of  the  world  is  socially  derived 
from  instruction  by  others  or  instructional  materials.  A word  of  caution.  The 
event  perception  hypothesis  should  not  be  confused  with  the  view  of  radical 
empiricism  which  asserts  that  perception  is  based  on  elementaristic  structures 
such  as  sense  impressions,  sensations  or  simple  sense  data.  Under  this  view 
abstract  concepts  are  built  up  by  a process  of  association.  By  contrast,  the  view 
we  are  offering  does  not  assume  a single  simple  level  of  analysis  is  sufficient  to 
characterize  the  relevant  information  liberated  by  events,  nor  does  it  assume  an 
associative  mechanism  for  compounding  complex  concepts  out  of  simpler  ones. 
The  cognitive  capacity  we  believe  to  be  at  work  is  generative,  rather  than 
associative,  possessing  considerably  more  structure  (probably  resembling  that  of 
a mathematical  group)  than  that  offered  by  associative  principles.  Indeed,  the 
only  premise  our  generative  theory  of  event  conceptualization  has  in  common 
with  radical  empiricism  is  affirmation  of  the  postulate  of  critical  realism, 
namely,  that  there  exists  a real  objective  world  about  whidi  we  may  have 
veridical  knowledge.  In  addition,  our  theory  of  the  knowing  agent  is  an  active 
one  where  that  of  the  empiricist’s  is  quite  passive.  Now  let  me  relate  our 
generative  theory  of  conceptual  knowledge  to  some  of  the  other  views  presented 
in  this  volume. 

Norman,  Centner,  and  Stevens  (Chapter  9,  this  volume)  suggest  a network 
description  of  the  concepts,  not  necessarily  linguistic,  involved  in  money  chang- 
ing, say  as  in  the  case  of  making  purchases  in  a grocery  store.  Here  objects, 
bearing  specific  relations  to  one  another,  are  being  transformed;  that  is,  money 
and  gcod:  are  being  exchanged  in  accordance  with  certain  economic  principles 
and  customs  governing  behavior  in  the  market  place.  Children,  according  to 
Norman,  eventually  leam  this  process  by  experiencing  those  events  in  which 
grownups  exchange  money  for  goods  (see  Centner,  1975).  He  begins  his 
network  analysis  of  this  knowledge  by  assuming  that  certain  nodes  and  relations 
exist  among  them,  but  his  model  does  not  yet  account  for  how  such  primitive 
nodes  'iid  relations  are  derived  from  experience.  The  generative  model  offered 
by  Wilson  and  me  seems  to  supply  the  missing  cognitive  component  by  which 
such  nodes  and  relations  are  originally  specified  by  events. 

Such  a generative  perceptual  component  is  no  less  necessary  to  round  out 
Greeno’s  (Chapter  7,  this  volume)  network  approach.  Moreover , Hyman’s  (Chap- 
ter 8,  this  volume)  impression  format;. in  task  seems  to  me  to  imply  the  existence 
of  such  a generative  cognitive  capacity  as  does  our  own  experimentation. 
Moreover,  it  seems  to  me  that  the  monster-globe-passing  problem  addressed  by 
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the  Simon-Hayes  (Chapter  14,  this  volume)  problem-solving  model  with  the 
“change’  operator  can  be  construed  as  an  attempt  to  study  the  affordance 
structure  of  a “monster-globe-passing”  event. 


KNOWLEDGE  ORIGINS  VERSUS 
KNOWLEDGE  REPRESENTATIONS 

Before  closing,  1 would  like  to  emphasize  the  distinction  between  the  representa- 
tion of  knowledge  and  the  origin  of  knowledge.  The  two  are  not  the  same.  The 
origin  or  epigenesis  of  knowledge  has  to  do  with  the  primitives  that  go  into  the 
representation,  the  nodes  and  relations,  the  schemata.  My  suggestion  is  that  the 
principles  revealed  by  the  study  of  how  event  concepts  arise  from  the  perceptual 
information  specifying  the  events  may  provide  an  inroad  into  the  fundamental 
problem  of  how  knowledge  should  be  represented.  For  if  we  attempt  to 
represent  knowledge  in  terms  of  networks  or  other  models  by  merely  selecting  a 
priori  labels  for  our  concept  nodes  and  relations,  then  we  may  end  up  with  a 
logically  possible  but  arbitrary  schematic  representation  of  knowledge  that  has 
little  to  do  with  how  the  concepts  actually  arose.  Our  knowledge  of  the  world  is 
interfaced  by  direct  perceptual  experience  of  events  and,  therefore,  the  percep- 
tual processes  involved  can  only  be  ignored  at  our  peril.  For  this  reason  the  role 
of  perception  seems  to  me  to  have  been  greatly  underemphasized  at  this 
conference.  There  seems  to  be  a tend  ncy  toward  too  much  theoretical  depen- 
dence on  secondary  processes  such  as  language. 

Let  us  recognize  the  fact  that  much  of  our  knowledge  is  either  tacit  or  purely 
nonverbal,  neither  being  able  to  a:vjme  a linguistic  form.  For  instance,  try  to 
give  an  adequate  verbal  description  of  nearly  any  object  (e.g.,  the  face  of  a friend 
or  relative),  any  act  (e.g.,  tying  your  shoelace,  riding  a bicycle)  or  any  other 
nontrivial  event.  We  are  never  able  to  verbalize  more  than  a small  tip  of  the 
iceberg  of  what  we  can  recognize  or  recall  about  a familiar  object,  act  or  event. 
Instruction,  therefore,  should  be  aimed  at  this  nonverbal  aspect  of  knowledge 
gathering  activities  as  well.  Learning  to  read  is  of  course  an  indispensable  social 
skill  but  so  is  learning  to  “see”  in  the  broadest  sense  of  the  term. 


AN  APPLICATION  TO  ART  INSTRUCTION 

As  argued  earlier,  this  view  that  perception  is  itseif  a mode  of  cognition  is  in 
no  way  antagonistic  to  the  purposes  of  this  conference,  nor  is  it  as  irrelevant  as  it 
may  at  first  seem.  Let  me  describe  a project,  on  which  I have  been  a consultant, 
that  uses  perceptual  principles  as  an  integral  part  of  its  instructional  design. 

In  1973  the  Minneapolis  Institute  of  Art  requested  help  from  the  Center  for 
Rerearch  in  Human  Learning  at  Minnesota.  P.  Salapatek,  A.  Jonas,  H.  Pick,  and 
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I,  as  perceptual  psychologists,  were  asked  to  help  in  the  development  of  an  art 
exhibit  for  children  to  celebrate  the  opening  of  the  new  art  educational  center 
attached  to  the  museum.  Our  task,  in  cooperation  with  artists  and  art  historians, 
was  to  develop  a sequence  of  modular  displays  aimed  at  the  explanation  of  how 
paintings  incorporate  perceptual  information  that  specifies  spatial  layout,  shape 
of  objects,  color  contrast,  and  so  forth.  Consequently,  we  have  developed  a set 
of  portable  modular  displays  in  which  each  is  designed  to  present  schematic  but 
working  models  of  situations  demonstrating  various  salient  forms  of  perceptual 
information.  For  instance,  one  display  allows  children  to  see  what  happens  to 
the  apparent  size  of  an  objeci  when  moved  across  a texture  gradient  toward  a 
simulated  horizon  of  vanishing  points.  By  this  display  we  hope  to  let  the 
children  learn  about  the  laws  of  linear  perspective  through  dynamic  manipula- 
tions which  provide  exemplars  for  the  relevant  concents. 

Next  to  the  display  is  a large  print  of  a painting  in  the  museum  that  clearly 
utilizes  the  same  perceptual  principles.  In  addition,  between  the  picture  and  the 
display  is  a schematic  drawing  presenting  in  outline  just  those  portions  of  the 
painting  relevant  to  the  principles  demonstrated  in  the  display  itself.  In  other 
words,  the  relationship  between  the  painting  and  the  display  it  clarified  by  a 
simpler,  intermediate  step  that  omits  confusing  or  irrelevant  detail.  In  another 
display  module  we  illustrate  in  a similar  fashion  how  shading  on  objects  or  their 
cast  shadows  provide  information  for  their  shape  or  distance  from  other  sur- 
faces, respectively.  Again  the  abstract  principles  are  illustrated  in  a high-fidelity 
copy  of  an  actual  painting  in  the  museum  interfaced  conceptually  to  the  visual 
display  by  a schematic  diagram  outlining  the  relevant  parts  of  the  picture. 

After  studying  the  dozen  or  so  visual  displays,  the  children  are  guided  through 
the  museum  proper  to  rooms  where  the  pictures  used  in  the  displays  are  actually 
hung.  Our  ultimate  goal  is  to  determine  the  time  that  the  childien  spend  looking 
at  the  pictures  represented  in  the  displays  and,  more  importantly,  the  time  spent 
looking  at  other  pictures  not  seen  before  that  clearly  portray  the  same  type  of 
perceptual  information  presented  in  the  display  modules.  We  hope  that  such 
measures  indicate  that  children  show  more  interest  in  those  paintings  expressing 
the  same  information  studied  in  the  displays  than  in  those  comparable  paintings 
not  relevant  to  principles  demonstrated  in  the  visual  displays. 

In  summary,  it  is  our  intention  to  interest  children  in  the  perceptual  problems 
solved  by  great  painters  in  their  works,  to  provide  them  with  a kind  of  problem 
solving  orientation  to  the  museum  rather  than  to  just  turn  them  loose  in  the 
galleries  to  be  arbitrarily  bombarded  by  a confusing  welter  of  information.  By 
giving  them  the  knowledge  of  the  appropriate  perceptual  concepts  beforehand,  it 
is  our  belief  that  they  will  be  encouraged  to  see  the  paintings  rather  than  just 
look  at  them.  It  is  interesting  to  speculate  on  how  a really  well-controlled 
experiment  of  this  type  might  provide  challenging  data  relevant  to  the  evaluation 
of  many  of  the  cognitive  models  presented  in  this  volume. 


Part  IV 

GENERAL  COMMENTS 


You  have  just  spoken  in  praise  of  me,” 
said  Socrates,  “and  now  it  is  my  turn  to 
speak  in  praise  of  my  right-hand  neighbor.  If 
Agathon  sits  next  to  you,  it  will  fall  to  him 
to  speak  in  praise  of  me  all  over  again, 
instead  of  my  speaking  in  praise  of  him.  Let 
it  be  as  I propose,  my  good  friend,  and 
don’t  grudge  the  lad  his  tribute  of  praise 
from  me,  especially  as  I have  strong  desire 
to  eulogize  him 

Agathon  got  up,  intending  to  move  to  the 
place  on  the  other  side  of  Socrates.  But  at 
that  moment  a crowd  of  revellers  came  to 
the  door,  and  finding  it  left  open  by  some- 
body who  had  just  gone  out,  made  their 
way  into  the  dining-room  and  installed 
themselves  there.  There  was  a general  up- 
roar, all  order  was  abolished,  and  deep 
drinking  became  the  rule  [Plato:  The  Sym- 
posium J . 
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In  reacting  to  this  volume,  1 shall  address  three  points:  the  application  of  theory 
to  practice;  the  nature  of  a prescriptive  science  of  instruction  and  its  compo- 
nents; and,  in  the  context  of  these  components,  specific  remarks  based  on  some 
of  the  chapters  that  have  been  presented. 


THE  APPLICATION  OF  THEORY  TO  PRACTICE 

Strong  applications  of  science  to  practice  generrlly  are  characterized  by  some 
linking  body  of  knowledge  ar.d  procedure— some  sort  of  a linking  science 
between  a descriptive  explanatory  science  and  application  by  professionals. 
There  is  rarely  a direct  relationship  between  (k  scriptive  science  and  professional 
work.  Attempts  at  direct  application  without  boil  ding  a linking  science  either 
tend  to  fade  away  or  are  subject  to  sporadic  flashes  of  interest  by  particular 
scientists  who  happen  to  become  interested  in  some  applied  problems  at  the 
moment.  In  either  case,  no  substantial  structure  is  built  up  into  which  rules  or 
hypotheses  for  professional  application  can  be  placed.  In  psychology,  an 
example  is  the  field  of  programmed  instruction  where  professional  societies  were 
established  which  were  uninhabited,  for  the  most  part,  by  people  who  main- 
tained a scientific  interest  in  learning  theory,  and  who  could  nurture  the 
development  efforts  and  new  attempts  in  programmed  instruction.  The  field 
spun  off  by  itself  with  its  own  body  tf  rules  and  declined  in  impact  because  it 
was  not  yet  ready  to  stand  by  itself.  The  danger  is  that  a body  of  practice  is 
picked  up,  is  deintellectualized  in  the  sense  of  becoming  separated  from  the 
theory  that  generated  it,  and  is  carried  out  in  a rote,  by-the-numbers  fashion. 
Some  of  the  chapters  in  this  volume  make  me  wonder  about  such  a danger.  We 
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may  need  to  guard  against  encouraging  potential  practitioners  from  immediately 
establishing  a Society  for  Cognitive  Research  on  the  Advancement  of  Pedagogy 
(SCRAP)1  that  is  not  nurtured  by  the  active  involvement  of  people  like  those 
represented  here.  Establishment  of  SCRAP  should  not  happen  for  some  time  to 
come.  For  the  present,  what  is  required  are  sustained  attempts  at  instruction, 
constantly  working  between  science  and  practical  problems.  This  interaction  by 
someone  who  has  both  aims  in  mind,  who  is  both  a trained  scientist  and  who 
takes  seriously  a practical  problem,  is  the  way  in  which  a linking  science 
generally  is  developed.  Only  after  the  linking  science  evolves  in  some  form  is  it 
possible  for  a professional  field  to  grow  and  for  people  to  work  only  in  the 
linking  science  itself.  The  game  for  us  at  this  time  is  tc  take  an  honest  interest  in 
instructional  problems,  to  continue  our  interest  in  a science  of  cognitive  psychol- 
ogy, and  to  play  between  the  two  until  this  Unking  structure  develops.  Simply 
turning  exciting  findings  over  to  someone  with  an  appUed  bent  or  turning  some 
interesting  procedures  over  to  teachers  to  use  will  not  do  the  trick. 

It  is  of  interest  to  note  that  in  1899,  in  a presidential  address  before  the 
American  Psychological  Association,  John  Dewey  (1900)  expressed  concern 
about  developing  a linking  science  between  psychological  theory  and  practical 
work: 

“Do  we  not  lay  a special  linking  science  everywhere  else  between  the  theory  and 
practical  work?  We  have  engineering  between  physics  and  the  practical  workingmen  in 
the  mills;  we  have  a scientific  medicine  between  the  natural  science  and  the  physician.” 
The  sentences  suggest,  in  an  almost  startling  way,  that  the  real  essence  of  the  problem  is 
found  in  an  organic  connection  between  the  two  extreme  terms-between  the  theorist 
and  the  practical  worker- through  the  medium  of  the  linking  science.  The  decisive 
matter  is  the  extent  to  which  the  ideas  of  the  theorist  actually  project  themselves, 
through  the  kind  offices  of  the  middleman,  into  the  consciousness  of  the  practitioner.  It 
is  the  participation  by  the  practical  man  in  the  theory,  through  the  agency  of  the  linking 
science,  that  determines  at  once  the  effectiveness  of  the  work  done,  and  the  moral 
freedom  and  personal  development  of  the  one  engaged  in  it. 

. . [If  not  the  teacher  is  compelled]  to  resort  to  purely  arbitrary  measures,  to  fall  back 
upon  mere  routine  traditions  of  school  teaching,  or  to  fly  to  the  latest  fad  of  pedagogi- 
cal theorists-the  latest  panacea  peddled  out  in  school  journals  or  teachers’  institutes- 
just  as  the  old  physician  relied  upon  his  magic  formula  [pp.  110-113], 

A linking  science  is  essentially  a science  of  design.  It  is  a prescriptive,  norma- 
tive science,  and  not  the  kind  of  enterprise  that  most  of  the  people  in  this 
volume,  who  have  been  working  in  explanatory  descriptive  science,  have  been  up 
to.  The  characteristics  of  a science  of  design  have  been  carefuUy  discussed  by 
Herbert  Simon  (1969),  and  need  not  get  into  those  details  here.  The  distinction 
between  a prescriptive  science  of  design  and  an  explanatory  descriptive  science  is 
made  clearly  by  Simon,  and  work  in  a prescriptive  science  mode  has  been 
exemplified  in  the  special  limited  case  of  stochastic  models  of  learning  in 


1 We  have  the  editor  to  thank  for  this  acronym. 
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Atkinson’s  Chapter  4.  In  general,  a prescriptive  science  provides  a schema  for  the 
profersional  in  a field  and  for  de, 'elopers  of  applications  who  provide  the 
professional  with  tools,  techniques,  and  instrumentation  for  professional  work. 
Concepts  and  ideas  are  not  directly  dumped  onto  the  teacher  or  the  college  of 
education.  A design  science  leads  to  the  development  of  instrumentation  for  the 
teacher  and  also  for  the  student  insofar  as  the  student  becomes  his  or  her  own 
teacher.  Talk  about  direct  application  by  descriptive  scientists  can  be  a lot  of 
hand  waving  unless  serious  development  of  a prescriptive  linking  science  takes 
place.  It  is  a very  difficult  task  at  the  moment  because  most  cognitive  psychol- 
ogists are  descriptive  scientists  by  training,  and  very  few  of  them  have  the 
temperament  to  work  at  a prescriptive  science  while  they  do  their  “ieal”  science. 


COMPONENTS  OF  A PSYCHOLOGY 
OF  INSTRUCTION 

Consider  now  some  possible  components  of  a prescriptive  science  or  instruc- 
tional psychology,  but  first  let  me  describe  in  very  general  terms  the  kind  of 
individual  cognitive  competence  to  which  these  components  refer 

The  Development  of  Competence 

The  process  of  instruction,  as  distinguished  from  education  in  general  is,  to  a 
large  extent,  concerned  with  the  development  of  competence  in  a learner  and 
with  the  behaviors  and  cognitive  structures  that  differentiate  the  novice  from  the 
competent  performer  in  a particular  subject  matter.  In  attaining  subject-matter 
knowledge  and  skill,  the  learner  proceeds  through  a novitiate  and  then  develops 
relative  expertise;  he  or  she  learns  to  be  a good  reader,  a competent  mathen. 
tician,  a deep  thinker,  a quick  learner,  a creative  person,  an  inquiring  indivi- 
dual, and  so  on.  These  activities  are  learned  according  to  criteria  of  expertise 
established  by  the  school  and  the  community;  more  specifically,  by  subject- 
matter  requirements,  peer-group  expectations,  and  the  general  social  and  profes- 
sional criteria  for  what  determines  lew,  average,  and  high  levels  of  competence. 
The  educational  and  social  community  adjusts  its  expectations  to  the  compe- 
tence level  of  the  learner  so  that  initially  awkward  and  partially  correct  per- 
formances are  acceptable,  whereas  later  they  are  not;  a young  child  or  a novice  is 
frequently  rewarded  for  rather  uninteresting  behavior,  but  as  competence  grows, 
his  performance  is  attended  to  only  if  it  occurs  in  the  presence  of  an  appropriate 
audience  or  in  an  appropriate  context. 

The  gross  changes  that  take  place  as  an  individual  progresses  from  ignorance  to 
increasing  competence  are  of  the  following  kinds: 

1.  Variable,  awkward,  and  crude  performance  changes  to  performance  that  is 
consistent,  relatively  fast,  and  precise.  Unitary  acts  change  into  larger  response 
integrations  and  overall  strategies. 
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2.  The  contexts  of  performance  changes  from  simple  stimulus  patterns  with  a 
great  deal  of  clarity  to  complex  patterns  in  which  relevant  information  must  be 
abstracted  from  a context  of  events  that  are  not  all  relevant. 

3.  Performance  becomes  increasingly  symbolic  and  covert.  The  learner  re- 
sponds increasingly  to  internal  representations  of  an  event,  to  internalized 
standards,  and  to  internalized  strategies  for  thinking  and  problem  solving. 

4.  The  behavior  of  the  competent  individual  becomes  increasingly  self- 
sustaining  in  terms  of  his  skillful  employment  of  the  rules  when  they  are 
applicable  and  his  subtle  bending  of  the  rules  in  appropriate  situations.  Increas- 
ing reliance  is  placed  on  one’s  own  ability  to  generate  the  events  by  which  one 
learns  and  the  criteria  by  which  one’s  performance  is  judged  and  valued. 

It  is  the  understanding  and  facilitation  of  this  process  of  change  from 
ignorance  to  competence,  from  novice  to  expert,  that  is  a major  focus  of  the 
framework  that  I shall  now  describe. 

Description  and  Analysis  of  Competent  Performance 

First,  description  and  analysis  of  competent  performance— the  state  of  knowl- 
edge and  skill  to  be  achieved.  Since  attention  to  the  instructional  problem  means 
facing  the  necessity  of  studying  behavior  in  tasks  considerably  more  complex 
than  those  typically  studied  in  the  psychological  laboratory,  the  development  of 
new  ways  of  analyzing  tasks  and  specifying  the  content  of  learning  is  required. 
Tasks  in  the  laboratory  have  been  selected,  for  the  most  part,  according  to  what 
is  convenient  and  manageable  for  experimental  and  theoretical  analysis.  Concen- 
tration on  tasks  artificially  constructed  for  experimental  purposes  has  meant 
that  few  psychologists  have,  until  recently,  confronted  the  problem  of  analyzing 
complex  tasks  in  terms  that  allow  access  to  psychological  theory  and  data,  and 
yet  still  preserve  some  Fidelity  to  the  real-life  character  of  the  tasks  themselves. 

For  the  psychologist  concerned  with  instructional  processes,  however,  the 
problem  of  task  analysis  is  a central  one.  Analytic  description  of  what  is  to  be 
learned  facilitates  instruction  by  attempting  to  define  clearly  what  it  is  that  an 
expert  in  a subject-matter  domain  has  learned;  for  example,  what  it  is  that 
distinguishes  a skilled  from  an  unskilled  reader.  When  this  analysis  identifies 
classes  of  behaviors  whose  properties  as  learning  tasks  are  known  or  can  be 
systematically  studied,  then  inferences  concerning  optimal  instructional  pro- 
cesses can  be  formulated  and  tested. 

Procedures  have  been  developing  for  the  analysis  of  tasks-analysis  of  the 
content  of  what  is  learned  and  the  properties  and  processes  involved  in  c<  ipe- 
tent  performance— and  task  analysis  is  characterized  by  the  description  o.' 
performance  in  terms  of  the  demands  placed  on  basic  psychological  processes 
such  as  attention,  perception,  and  linguistic  processing,  and  on  knowledge  and 
skills  assumed  to  be  in  the  learner’s  repertoire.  Further,  since  an  individual’s 
capacities  change  over  time,  task  analysis  reflects  current  knowledge  and  assump- 
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lions  concerning  the  ptoceues  Kibble  at  different  stages  of  learning  o.  develop- 
ment  (see  Rcsnick,  Chapter  3;  KlabrA  Wallace,  1976). 

As  this  volume  reflects,  a mapr  activity  in  this  regard  ' *ing  carried  out 
through  modeling  and  simulating  (computer  simulation  or 
that  can  reproduce  complex  performances  like  playing  chess,  solving  algebra 
word  problems,  series  completion  problems,  certain  problemsolving  tkdh. 
understanding  complex  instructions.  The  processes  chosen  represent  r combine 
tion  of  what  is  observed  in  the  perform*:  subject  and  of  theory  concenOng  the 
characteristics  of  the  conceptual  processes,  memory  processes ^ involved .the 
semantic  information  structure  that  are  buflt  up.  The  simulation  of  complex 
performance  represents  a formal  technique  for  establishing  the  logical  sufficien- 
S and  necessity  of  certain  processes  and  the  way  they  are  assembled  ,o 
producing  particular  performances.  However,  for  the  purposes  of  instructs 

TTxt  mows  required  in  cognitive  simulation  work  have  not  been 
made]'  and  we  need  to  investigate  further,  ft  is  possible  that  ^ -formation 
obtained  from  this  kind  of  analysis  of  human  competence  can  allow  us  to  do 
several  things  with  regard  to  the  optimizing  of  instruction: 

1 Specification  of  the  processes  by  which  highly  competent  individuals  might 
b<*  performing  a task  puts  us  in  a position  to  try  to  teach  these  processes  ^ 
individual  lexers;  for  example,  knowing  that  a large  “perceptual  vocabulary 
of  the  configuration  of  pieces  on  a chess  board  is  a characteristic  of  an  efficient 

l^pCLgbt  Jnr  us .0 a««mp.  <"  <«*  •“*  m 

mot,  efficiently  it.  on,  way  tathe,  ft. 

another  and  Lowing  It.  procedute.  involved  in  th,  mom  .Item  «W  P“ 

US  in  a position  to  design  instruction  so  that  it  approximates  the  most  ef.icient 

inethod. 

The  very  interesting  question  at  the  moment  is  the  teachability  of  the 
processes  that  are  identified  through  simulation.  If  we  teach  these  proasses, 
then  is  acquisition  of  the  performance  influenced,  and  will  “ 

obtain  competence  than  have  in  the  past?  Or,  is  it  likely  tha  we  tea  these 

processes  X will  put  the  learner  in  the  position  of  the  centip  dyred 

L processes  by  which  he  moved  his  hundred  legs,  and  beca  £ 

walking  at  all?  At  any  rate,  hi  my  own  laboratory  (Holzman,  1 7 ) 
currently  invesUgating  the  teachability  of  processes  that  have  been  derived  by 
procedures  of  cognitive  simulation.  As  a sta  rt,  the  particular  task  we  I *»“n 
is  the  series  completion  problem  as  analyzed  by  Simon  and  Kotovsky  (1963). 

A look  at  the  Current  literature  in  experimental  psychology  shows  an  increas- 
ing  number  of  studies  devoted  to  the  analysis  of  complex  cogmUve  perforrnance 
m coottast  to  the  study  of  learning.  WW1,  Ms  trend  ts f“""S  ^ 
description  of  competence  in  inteUectual  tasks,  many  of  the  studies  can  be 
faulted,  from  the  present  point  of  view,  for  their  lack  of  explicit  instructional 
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orientation.  The  explicit  requirements  necessary  for  the  analysis  of  tasks  relevant 
to  the  development  of  an  instructional  psychology  are  the  following:  (1)  that 
complex  real-life  tasks  be  investigated  and  not  tasks  designed  for  laboratory 
convenience;  (2)  that  these  realistic  tasks  be  analyzed  in  terms  of  current 
theoretical  concepts  in  cognitive  psychology;  (3)  that  the  processes  identified  be 
considered  with  respect  to  their  instructability  and  the  conditions  that  influence 
their  acquisition;  and  (4)  that  they  be  related  to  the  cognitive  capacities  or  the 
developmental  status  of  the  child  and  to  the  background  capabilities  of  adults. 

' .i 

Description  and  Diagnosis  of  the  Initial  State  ^ 

The  second  component  of  instructional  design  is  description  and  diagnosis  of  the 
initial  state  with  which  an  individual  begins  a course  of  learning.  There  is  an  |£. 
“immediate"’  and  a “long-term”  approach  to  this  aspect.  The  immediate  ^ 
approach  is  to  take  seriously  the  fact  that  effective  instruction  requires  careful 
assessment  of  the  initial  state  of  the  learner.  For  individual  learners,  we  need  to 
know  their  strengths,  weaknesses,  styles,  and  background  interests  and  talents. 

What  are  the  details  of  what  a child  knows  and  does  not  know  at  particular 
points  in  his  or  her  learning?  What  are  the  details  of  the  skills  that  he  or  she  is 
developing?  What  needs  to  be  improved?  What  strengths  can  be  capitalized  on? 

What  do  various  developmental  levels  and  various  cultural  backgrounds  mean  for 
what  should  be  taught  and  how  it  should  be  taught?  Educational  practices  need 
to  be  designed  so  that  answers  to  these  kin  s of  questions  are  possible  for  all 
individuals  attending  school.  Teachers  and  learners  need  to  be  in  a position  to  f 
obtain  and  utilize  this  information;  with  it,  teachers  can  prescribe  the  instruction 
required  and  students  can  assess  their  own  abilities  and  select  appropriate 
instruction. 

Tht  development  and  use  of  procedures  for  providing  this  information  is 
necessary,  but  I do  not  mean  at  all  to  imply  that  learning  and  schooling  should 
be  one  big  series  of  formal  tests  and  assessments.  1 would  suggest  informal 
observation  as  well  as  infrequent  formal  assessment,  and  adoption  of  an  attitude 
that  the  information  obtained  is  information  for  improving  instruction  and  not 
for  constant  evaluation.  My  colleagues  and  I,  in  our  own  work  in  schools,  have 
found  it  desirable  to  provide  primary-grade  teachers  with  hierarchies  of  increas- 
ing competence  in  various  school  subjects.  These  take  the  form  of  “structured 
maps”  into  which  a teacher  can  place  a child  and  thereby  direct  attention  to 
prerequisite  skills  that  might  need  to  be  leumed  or  advanced  skills  that  the  child 
might  explore.  The  hierarchical  map  is  only  a guide  upon  which  both  the  teacher 
and  the  child  can  impose  their  own  judgment.  Procedures  for  assessing  the 
current  competence  and  talents  of  the  learner  in  a way  that  provides  a basis  for 
instruction  are  generally  not  available  in  current  educational  methods  at  a level 
of  detail  necessary  for  the  effective  guidance  of  individual  learners.  1 call  the 
implementation  of  these  procedures  the  “immediate”  approach  because,  to  a 
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large  extent,  it  is  secondarily  a matter  of  research  and  primarily  a matter  of 
administrative  change  and  the  design  of  appropriate  materials. 

The  long-term  approach  relates  to  the  fact  that  while  a prevalent  method  for 
assessing  initial  state  is  the  assessment  of  aptitudes  that  correlate,  to  some 
extent,  with  end-of-school  achievement,  aptitudes  so  assessed  do  not  provide 
information  about  instructional  processes  (see  Glaser,  1972;  Hunt,  Frost,  & 
Lunneborg,  1973).  They  are  measures  used  for  purposes  of  selection  and  do  not 
provide  a basis  for  deciding  upon  instructional  alternatives.  Aptitude  informa- 
tion does  not  tell  how  an  individual  should  be  instructed  to  improve  his 
performance,  nor  how  instruction  might  be  designed  to  make  the  attainment  o 
successful  performance  more  probable.  The  significant  research  requirement  in 
this  regard  is  to  describe  the  initial  state  of  the  learner  in  terms  of  processes 
involved  in  achieving  competent  performance.  This  would  then  allow  us  to 
influence  learning  in  two  ways-to  design  instructional  alternative:  tluit  adapt  to 
these  processes,  and  to  attempt  to  improve  an  individual’s  competence  in  these 
processes  so  that  he  or  she  is  more  likely  to  profit  from  the  instructional 
procedures  available. 

Conditions  that  Foster  Learning 
and  the  Acquisition  of  Competence 

The  third  component  of  instructional  design  comprises  the  conditions  that  foster 
learning  and  the  acquisition  of  competence-.'ssentially , the  procedures  by  which 
one  learns  and  the  nature  of  the  environment  in  which  learning  occurs.  There  are 
at  least  two  directions  in  this  regard  that  need  to  be  taken  for  the  development 
of  an  instructional  psychology. 

The  first to  recognize  that  we  do  know  a little  about  learning.  For  example, 
we  know  some  things  about  the  effects  of  reinforcement-the  contingencies 
consequent  to  performance;  about  the  conditions  under  which  discriminations, 
generalization,  and  concept  formation  take  place;  and  about  conditions  of 
practice,  interference  with  memory,  the  nature  of  attention,  the  effects  of 
punishment,  and  how  observational  learning  and  modeling  can  influence  new 
learning.  We  know  these  things  in  terms  of  descriptive  science,  but  little  investi- 
gation has  been  made  from  the  point  of  view  required  for  the  utilization  of  this 
information  for  designing  the  conditions  of  instruction.  Exceptions  to  this  are 
the  work  on  behavior  modification  and  the  work  on  optimization  models 
described  in  the  chapter  by  Atkinson.  However,  neither  of  these  enterprises  has 
considered  complex  cognitive  performance  in  any  systematic  way. 

What  k required  is  research  on  what  we  know  about  learning  cast  into  the 
mold  ot  a design  science  which  attempts  to  maximize  the  outcomes  of  learning 
for  different  individuals.  A new  form  of  experimentation  would  be  called  for 
where  the  tactic  is  not  to  develop  models  of  learning  and  performance,  but  to 
test  existing  models  by  using  them  for  maximizing  the  effects  of  learning  under 
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various  conditions.  For  this  purpose,  we  need  a theory  of  the  acquisition  of 
competence.  Such  a theory  would  be  concerned  with  how  an  individual  acquires 
increasingly  complex  performances  by  assembling  the  present  components  of  his 
repertoire,  manipulating  the  surrounding  conditions  and  events,  and  employing 
his  knowledge  of  how  he  learns. 

Effects  of  Instructional  Implementation 
in  the  Short  and  in  the  Long  Run 

The  fourth  component  of  instructional  design  is  concerned  with  the  effects  of 
instructional  implementation  in  the  short  and  in  the  Ion1}  run-effects  that  occur 
immediately  in  the  context  of  instruction  and  supply  relatively  immediate 
feedback,  and  effects  that  persist  in  terms  of  long-term  transfer,  generalized 
patterns  of  behavior,  ability  for  further  learning,  and  so  on.  One  requirement  for 
this  purpose  is  to  break  away  from  the  tradition  of  norm-referenced  measure- 
ment to  measurement  more  concerned  with  identifying  the  nature  of  criterion 
performance  (see  Glaser,  1963;  Glaser  & Nitko,  1971).  For  effective  instruc- 
tional design,  tests  will  have  to  be  criterion  referenced  in  addition  to  being  norm 
referenced.  They  will  have  to  assess  performance  attainments  and  capabilities 
that  can  be  matched  to  available  educational  options  in  more  detailed  ways  than 
can  be  carried  out  with  currently  used  testing  and  assessment  procedures.  This 
will  be  an  important  part  of  the  development  of  a psychology  of  instruction.  It 
is  mandatory  that  testing  not  stand  out  as  an  extrinsic  and  external  adjunct  of 
instruction.  Tests  need  to  be  interpreted  in  terms  of  performance  criteria  so  that 
the  learner  and  the  teacher  are  informed  about  an  individual’s  progress  relative 
to  developing  competence.  In  this  way,  information  is  provided  for  deciding 
upon  appropriate  courses  of  action  for  assisting  instruction. 

The  processes  measured  by  tests  designed  to  fadlatate  instruction  need  to  be 
related  to  processes  identified  as  components  of  competence.  For  this  purpose, 
some  interesting  endeavors  can  be  envisioned.  A good  example  in  this  volume  is 
the  work  going  on  in  analyzing  the  processes  involved  in  the  comprehension  of 
written  language.  Stimulated  by  the  work  in  psycholinguistics  and  cognitive 
psychology,  there  is  a great  deal  of  excitement  about  the  nature  of  language 
comprehension  processes  at  this  time.  This  excitement  should  be  juxtaposed 
with  the  fact  that  there  has,  over  years,  been  a great  deal  of  work  on  the 
development  of  tests  of  reading  comprehension.  The  new  development  is  that  as 
we  begin  to  analyze  comprehension  tasks  and  relate  them  to  theories  of  semantic 
memory,  imagery,  and  so  forth,  we  can  begin  to  develop  tests  that  provide  us 
with  diagnostic  information  about  component  processes  that  contribute  to 
performance  and  that  can  be  influenced  through  instruction.  This  kind  of 
activity  should  change  the  nature  of  assessment  procedures  and  provide  the  kind 
of  information  required  for  maximizing  instructional  outcomes. 
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ADDITIONAL  COMMENTS5 


I would  like  now  to  comment  on  specific  chapters  in  this  volume  in  relation  *o 
the  components  that  I have  just  described. 

Analysis  of  Competence 

vriSTST;  **  "**7  C°mpetent  emprises  most  of  this 

° fthC  “mp°"ent!  that  1 listed,  this  volume  rates  high  on  the  explica- 
hon  of  performance  theory  and  the  analysis  of  states  of  competence  Several 

st^T  ^ ib°T*tei  dcsciiption  of  competence  in  terns  of  cognitive 
racture,  sequential  task  analysis  to  show  changes  brought  about  by  insertion 

^o„.  Gmeno  »d  Non™  .treLd  cn^£ 
jectives,  they  would  like  to  display  a student’s  cognitive  structure  and  state 

ssissr.ir  r-  ~ " <*.  “ir 

lTrm  7!  n- H f 7S'  bUt  they  leaVe  US  "*re  technological  prob- 
T br°UBht  “P  thlS  P™1  When  he  «*«»  how  one  teaches  toward  Shese 

,rr  gt° impUe’  -*■  * 

ture  can  be  given  directly  to  teachers  and  students  in  order  to  facilitate  th- 
«cqi^tK,n  of  comp,,^.  „ „ hgrd  t0  ^ how  ^ h doM 

,ub«,  m 1 If  P,0're“,t'  incr'“"*ly  complex  theories  of 
ff'  tot  ssslst  learning  and  am  approximation,  to  the  kinda  of  theory 

f'rrrr  "rpioy’'  T"ch,n*  p™*dums  hm  ™ 

ZT, T ? T - ~ndThi,t.dT^ 

cated  cod  a shorn  of  a body  of  knowledge.  I,  i,  p„,hble  tha“ 

m,„eTnkf„drofhr;^:b’K,™'"r  wi'd8' miehi  — -»■<«-► 

tin*  “*  °f  «*"*««  foe  ins, me- 

l Z f d p °n’  8 Problem  also  raised  by  Carroll.  How  does  one 

o^t  StrUCtUreS?  ^ **  described  “I  the  same  term,  tottaSS 
of  cognitive  processing  and  cognitive  theories  are  written  in?  The-e  is  no  answer 
to  this  question  forthcoming  from  this  volume.  It  may  be  useful  however  to 
point  out  that  when  Atkin*>n  built  his  instructional  programs  in  reaZ^d 

XTrLZT’,  : *?“  WitH  SU,ldaid  «U^~.  He  Jketftt 
ubject  matter  and  broke  it  up  into  what  seemed  to  be  reasonable  units  as  anv 

instructor  would  do.  On  these  units  of  analysis,  he  imposed  his  tetrJ- d 
th^o^oTnretr!^  7*™  7*  *"  "wnl  referen«‘  >°  the  material  introduced  a, 

Stas  sa~  - 
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technology.  In  the  case  of  learning  foreign  language  vocabulary,  he  analyzed  the 
subject  matter  in  more  cognitive  terms.  This  was  part  of  his  implicit  description 
of  the  building  up  of  — .iory  structures  that  underly  the  competence  he  sough 
to  teacn.  As  a general  answer  to  the  level-of-analysU  problem,  it  seems  that  there 
should  be  a difference  between  science  and  practice.  A petroleimj  chemis  c 
think  about  atoms  and  forces  in  molecular  ctructures,  but  he  also  can  think 
about  gross  quantities  of  gases  and  residues  in  the  fractionating  tower.  H.s 
theory  can  work  back  and  forth  between  the  two  systems  of  description.  The 
“linking  science”  I mentioned  earlier  should  enable  this  to  occur  if  at  all 

T^optiriization  procedures  such  *s  those  carried  out  fay  Atkinson,  “j* 
analysis  proceeds  as  learning  occurs.  Continuous  assessment  is  made  of  the 
difficulty* and  structure  of  the  task.  This  technique  of  ongoing  task  analysis 
might  be  a useful  way  to  study  the  acquisition  of  competence.  For  example,  in 
the  BASIC  Instructional  Program,  the  program  stores  the  student  s evaluatim  of 
his  own  skills  and  compares  his  solution  attempts  to  stored  models.  In  this  way, 
information  can  be  obtained  about  the  changing  structure  of  developing  compe- 
tence that  might  not  be  uncovered  by  more  static  techniques. 

A recurrent  theme  of  the  volt  me  is  that  a target  of  instruction  is  competence 
in  process,  for  example,  the  constituent  comparison  exercises  to  improve  reading 
comprehension  suggested  by  Just  and  Carpenter.  Shaw’s  generative  system  might 
be  thought  of  asaspedal  process  skill  that  is  a teachable  entity,  and  not  ordy  a 
theoretical  description  of  what  occurs.  Simon  and  Hayes  suggest  giving  students 
a wide  repertoire  of  problem-solving  styles  and  detection  cues  for  matching 

styles  to  situations. 

Diagnosis  of  Initial  State 

In  contrast  to  the  analysis  of  attained  competence,  this  volume  had  less  to  say 
about  this  second  component,  the  diagnosis  of  the  initial  state.  There  were 
several  themes:  notions  about  some  kind  of  psychometrics  of  process  for  initi 
state  assessment,  consideration  of  developmental  growth  changes,  and  the  notion 
of  schema  discrepancy  between  initial  state  and  competence  state  Calfee  in 
Chapter  2 emphasized  techniques  for  the  assessment  and  diagnosis  of  processes 
in  initial  state  competence.  Wallace  referred  to  "process  profiles  prior  to 
instruction  and  the  design  of  custom-built  training  as  a function  of  these 
profiles.  Siegler  talked  oi  the  developmental  aspects  of  the  facihty  of  Id 
children  to  use  analogy  information  as  compared  to  younger  children,  and  he 
brought  to  our  attention  the  necessity  for  taking  developmental  growth  phe- 
nomena into  account  in  assessing  initial  states.  Hyman’s  notion  of  discrepancy 
from  schema,  the  extent  to  which  old  information  influences  new  information, 
strongly  suggests  initial  competence  analysis  prior  to  a course  of  instruction. 


17.  INSTRUCTIONAL  DESIGN 


3?  3 


Acquisition 

The  third  component  involves  . theory  of  acquisition^ 

an  initial  state  to  competent  performance.  This  component  was  notexpbatly 

addressed  in  most  chapters.  There  was  some  concern  about 

performance  that  are  involved  when  someone  proceeds  from  novice  to  exjwrt. 

*e  growth  of  the  cognitive  net,  increasing  symbiotic  manipulation,  ocmcept 

genemtiori  from  spaX  even..,  *«*».> Z 
external  cues.  bigger  ch  uilriug  underling  petfomt®-.  “d  “™' 
teristics  of  the  changes  in  competence  as  one  moves  from  novcc  toexper^e 
primary  data  required  for  a theory  of  acquisition,  and  earlier,  I described  som 
Lss  features  of  these  changes.  Wallace  discussed  changes  m the  acqiusthon  of 
Competence  when  he  described  the  shift  in  “artentional  p»n 
attention  to  dimensional  attention.  Resnick  presented  her  work  with  Gr*nthat 
described  the  change  in  children  from  one  method  of  addition 

'TClenerL  however  while  the  contributors  had  some  interesting  ideas  about 
pr^o^deXmg  competence,  when  they  talked  about  teaching^ 
Locesses  they  fell  back  on  standard  “behavioristic”  training  procedures.  Wallace 
mentioned  “failure  theory”  by  which  he  meant  that  he  obtained  in  ormation  on 

Z^r.c“de,  id  L W4*  .0  the-  Tb.p.ocxd^b,  ~d 

a fonn  of  the  succeswve  approximations  procedure  popu^nnd  by  sk™  ' 
Hayes  talked  about  teaching  problem  solving  and  employed  good  common^ 
"Sf procedures.  In  talking  about  penem.o,  vm.  Shaw  71*“““  'h 
imnortance  of  good  exemplars  in  the  development  of  concepts,  which  is  a 
familiar  tedmiqSe  deeeloped  in  behavioral  studie,  of  concept  formanm:  this  it 
SCS  "»>•“  one  gets  in  sdiool jnty 

not  be  verv  well  designed  to  provide  rich  exemplars  of  concepts  a.-d  Sha 
emphasized' this  .pin  a.  a deeper  process  level  Collin.  $£ 

used  Socratic  dialogue  as  his  insBuctional  procedure.  Norman, 
of  knowledge  stressed  the  importance  of  continuous  modification  of  a know! 

schen^’Lid  then  when  pressed  about  teaching  technique  he  suited 
practice  with  feedback.  The  point  is,  as  CarToU  indicated,  that  behavioral  ory 
„ successful  because  it  focuses  on  observation  of  the  development  of  behavioral 
components;  cognitive  themy  at  this  Urn.  is  difficult  to  express  m umpk 
observable  terms,  and  it  certainly  needs  such  expression  to  decide  upon  appro- 
priate instructional  transformations.  When  this  is  done,  new  will  teaching  pr 

“S^TnSde  a good  point  in  calling  attention  to  time  spans  in  attaining 
competence.  He  said  that  it  takes  five  to  ten  years  to  become  a good cook,  m 
an  interesting  question  in  going  from  ignorance  to  competence  awhy< bfTeren 
areas  of  knowledge  and  skill  require  longer  and  shorter  tunes  for  co  pe 
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be  attained  Instruction  has  to  be  thought  of  in  such  terms;  learning  to  play  the 

STSr-ta . k-.  **  •*—  i“"in»  •»  * ° » 

*z  rzs  !“*»»■ .*>««» •>*  »r  “mpe,““ by;. 

rxoreriure  of  continuous  decision  making-decisions  for  prescribing  t e con  1 
^“rilg  urar  need  .0 be  presented  as . function  of.be  date oburn.d on 
student  performance.  The  question  that  comes  to  mind  is  whether  the  n 

he  employ,  is  a good  dime, ion  io,  a scene.  -■ 

tioo  The  procedure  requires  mathematical  statement  of  a learning  function,  this 

HoTJSH  Atkinson  indicares,  only  fo,  very  simple  and  quite  Ur- 
LmToftahavtor.  The  question  is  die  following:  If  one  thinks  of  usrng  dies. 
opirniiaatiOT  proced  ires  with  di.  more  complex  theories  of  com.,, ore  structures 
presented  in  Oris  volume,  then  how  are  they  expressed  in  a form  drat  might  be 
Ld  with  the  optimisation  procedures  available?  Even  jf  mathematical  “P"5' 
sion  is  possible,  feasibility  of  computation  may  get  out  of  hand. 
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Effects  of  Implementation 

The  fourth  component  of  a psychology  of  instruction  is  ^ 

effects  of  the  conditions  provided  for  acquisition.  In  this 
point  that  the  kind  of  feedback  Skinner  emphasized  in  operant  con*  *°n  g 
essentially  information  on  the  topography  of  a response.  In  Skmner-hke  pro 
grammed  instruction,  it  appears  that  it  is  the  topography  of  the  response  that  is 
TeToperty  of  perform^.  that  is  being  shaped -successive  appro^auon 
irradSaln»atial,  gating  closer  and  closer  to  terminal  competence;  it  is  the  effect 
of1“tru^on.i  additions  on  this  response  aspect  for  which  information  is 
reouired  to  further  conduct  the  instructional  procedure.  Atkinson  uses  as  feed_ 
back  the  difficulty  of  items  in  a Ust;  he  employs  response  history  and  update  of 
item  difficulty  parameters  to  obtain  measurements  of  the  effects  of  '"s,r"ctl™ 
conditions.  Greeno  and  Norman  talk  about  getting  information  on  the  structure 
of  ^cognitive  map  of  knowledge,  and  as  I said  before,  they  leave  us  wrih  a 
large  lexical  problem.  How  do  we  measure  “it”  to  assess  instruction  ana  to 
provide  the  feedback  required  to  student  and  teacher?  Can  * representauon  be 
nrovided  to  a teacher  or  to  a student,  and  can  they  be  taught  instructional  skills 
bv  which  they  can  change  f,u.  representation?  Fletcher  reminded  us  that  if  we 
uL  a computer  for  storing  models  of  individual  students  to  represent  their  state 
of  learning,  we  would  seriously  tax  computer  memory. 
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I mentioned  the  notion  of  a psychometrics  of  process  measurement,  and  this 
brings  up  a point  about  the  analysis  of  SAT  competence  in  the  work  of  Just  and 
Carpenter.  When  they  analyze  the  SAT-type  examinations,  they  may  be  getting 
something  out  of  their  analysis  that  is  not  what  they  are  looking  for  The  SAT 
tests  are  designed,  in  the  usual  psychometric  way,  to  yield  a spread  of  s res.  A 
dispersion  of  item  difficulty  is  built  into  the  test  in  order  to  maximize  te^i-score 
vananct  so  that  the  correlation  with  criterion  variables  is  maximized.  The  tests 
are  selec  ed  both  for  the  stimulus  situation  they  provide  and  for  certain  psycho- 
metric properties.  When  Just  and  Carpenter  analyze  the  perforrnanc  processes 
involved  in  the  SAT  tests,  they  may  be  analyzing  also  the  properties  of  psycho- 
metric item  difficulty.  The  extra  considerations  built  into  the  requirements  for 
obtaining  correlational  validity  may  introduce  extraneous  processes  that  would 
not  be  present  in  a test  designed  to  measure  criterion  performance  more  directly. 

As  a final  remark,  I must  state  as  strongly  as  possible  that  we  cannot  think  in 
terms  of  standard  classrooms  as  we  begin  to  think  about  the  application  of  the 
concepts  of  this  volume  to  hie  design  of  instruction.  Even  if  we  claim  to  know  a 
good  deal  about  the  processes  involved  in  word  decoding  and  beginning  reading 
the  engineering  problem  of  clacsroom  material  and  environmental  design  is  ever 
present.  If  we  continue  to  tHok  i i terms  of  the  standard  classroom  rather  than 
in  terms  of  how  schooling  can  be  restructured,  then  our  efforts  will  be  seriously 
attenuated  and  kept  down  by  the  weight  of  traditional  educational  modes. 
Finally,  I agree  with  and  must  reemphasize  a general  sentiment  of  this  volume 
that  work  on  instructional  and  educational  problems  will  be  a major  test  of  our 
theories  of  human  cognitive  performance. 
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Implications  for 
Instructional  Research1 

Courtney  B.  Cazden 
Harvard  University 


These  comments  come  from  a background  of  elementary  school  teaching  and 
developmental  psychology.  Recently,  l have  been  concentrating  on  how  children 
learn  (or  acquire  or  develop)  their  native  language,  and  on  the  environmental 
events  which  may  affect  that  learning  in  and  out  of  school  (Brown,  Cazden  & 
Bellugi,  1969;  Cazden,  1972).  I was  an  elementary  school  teacher  in  the  late 
1940s  and  1950s.  Now  I’m  going  back  to  teach  children  for  one  year  in  a 
primary  classroom  in  an  inner-city  public  school. 

In  thinking  auout  these  plans,  I realized  that  the  three  topics  which  Hugh 
Mehan,  director  of  the  teacher  training  program  at  the  University  of  California, 
San  Diego,  and  I proposed  to  investigate  in  my  classroom  can  all  be  reformulated 
in  terms  of  topics  discussed  here.  (A  validation  of  the  sophistication  of  our  plans 
or  the  real-life  relevance  of  these  discussions?)  First,  what  is  a “relevant  curricu- 
lum”? In  Norman’s  sense,  what  do  children  know  that’s  relevant  to  what  we 
want  to  teach— relevant  either  because  it  can  help  or  because  it  may  interfere? 
Second,  how  do  children  understand  or  fail  to  understand  instructions,  primarily 
oral  instructions  because  the  children  will  be  young  and  on  the  edge  of  literacy  I 
mean  particularly  the  kind  of  instructions  that  teachers  seem  to  give  frequently 
(Mehan,  1972)  and  that  are  ill  structured  in  Simon  and  Hayes’  sense  because 
they  put  “demands  on  the  knowledge  and  repertory  of  problem-solving  skills  of 
the  solver”  (page  279)?  Can  we  help  children  cope  with  such  questions  by 
recognizing  their  incompleteness  and  asking  for  more  information?  Third,  in 
what  subtle  ways  do  teacher  expectations  get  produced  on  the  one  hand,  and 

1 Editor’s  note:  In  this  chapter  there  are  several  references  to  material  introduced  at  the 
Workshops  presented  during  the  symposium  by  Bamberger  and  Collins.  For  a brief  summary 
of  these  Workshops,  see  the  Preface  to  this  book.  BamWger’s  work  is  also  described  in 
Bamberger  (1972),  and  Collins’  in  Collins,  Warnock,  Aiel  and  Miller  (1975). 
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have  their  etfects  on  the  other  hand?  As  one  example  of  Hyman’s  concern  with 
how  people  make  sense  out  of  their  world,  how  does  a teacher  in  a very  complex 
environment  make  sense  out  of  her  children?  If I refrain  from  “preparing  my 
mind’  (to  adopt  Hyman’s  phrase)  and  do  not  look  at  the  children’s  cumulative 
records,  can  we  track  how  cues  from  the  children  are  used  in  making  decisions 
before  or  during  instruction,  and  how  the  categorizations  of  the  children  that 
will  inevitably  be  built  up  affect  my  interactions  with  them? 

In  spelling  out  these  reformulations  of  my  plans  for  next  year,  1 don’t  intend 
to  imply  that  tile  deliberations  of  this  conference  should  be  judged  by  their 
usefulness  next  September.  On  the  contrary,  the  more  appropriate  question  is 
about  the  implications  for  research  on  instruction  during  the  next  five  years. 
From  that  perspective,  the  comments  below  fall  under  four  headings:  the 
concept  of  instruction,  cognitive  objectives,  task  analyses,  and  means  of  instruc- 
tion. 


THE  CONCEPT  OF  INSTRUCTION 

1 am  impressed  by  the  broad  concept  of  instruction  explicit  in  some  of  the 
chapters,  and  implicit  throughout  this  volume.  In  its  catchiest  version  (Olson, 
this  volume),  instruction  includes  muddling  by  the  learner  as  well  as  modeling 
and  meddling  by  the  teacher.  In  Resnick’s  (Chapter  3)  more  precise  definition, 
Instruction  means  “any  set  of  environmental  conditions  that  are  deliberately 
arranged  to  foster  increases  in  competence.  Instruction  thus  includes  demon- 
strating, telling,  and  explaining,  but  it  equally  includes  physical  arrangements, 
structure  of  presented  material,  sequences  of  task  demands,  and  responses  to  the 
learner’s  actions  [page  51].”  In  Carroll’s  application  to  language  skills,  instruc- 
tion is  “defined  broadly  as  any  external  influences  on  the  development  of 
language  skills,  as  represented  both  by  formal  teaching  actions  and  by  more 
informal  social  interactions  [page  5] .” 

There  have  undoubtedly  been  many  influences  on  this  broadened  definition. 
For  me,  two  of  the  most  important  have  been  the  writings  of  more  persuasive 
proponents  ot  ‘ open”  or  “learner-controlled”  education,  and  the  research  on 
language  learning  which  Carroll  has  summarized.  There  is  a noncoincidental 
relationship  between  these  two  influences,  because  the  ease  with  which  children 
learn  their  native  language  without  instruction  more  narrowly  conceived  has 
been  used  (e.g.  by  Holt,  1%7;  Morrson,  1964)  as  an  argument  for  more  open 
education.  Resnick’s  definition  extends  this  broadened  concept  to  all  learning, 
not  just  language,  and  to  all  settings,  not  just  the  one  formally  designated  place 
we  call  a classroom  So,  for  example,  the  design  of  museum  displays  mentioned 
by  Shaw  (pages  298-299)  is  included,  and  >ecomes  one  among  many  additional 
settings  for  research. 


i VifTiWfltk’Ai'-V'll'rVr  i"' ~ 


18.  IMPLICATIONS  FOR  INSTRUCTIONAL  RESEARCH  319 


COGNITIVE  OBJECTIVES 


Like  the  concept  of  instruction,  the  concept  of  cognitive  objectives  is  a signifi- 
cant contribution  of  this  conference.  In  all  discussions,  cognitive  objectives 
comprise  both  conceptual  and  procedural  knowledge,  or  “semantic  networks 
and  problem-solving  algorithms'’  (Greeno,  Chapter  7).  But  the  distinction  be- 
tween tnese  cognitive  objectives  and  the  older  formulations  of  behavioral  objec- 
tives needs  to  be  strengthened  further  if  it  is  not  to  collapse  under  the  pressures 
of  evaluation.  Evaluation  requires  judgment  about  some  behavior.  How  can  the 
evaluator  be  sure  that  a particular  behavior  indicates  underlying  knowledge 
rather  than  rote  and  limited  procedure?  In  Wertheimer’s  terms  (Resnick,  pages 
58-59),  how  can  we  tell  the  difference  between  “understanding”  and  “insight” 
vs.  “ugly”  problem  solving  solutions?  In  the  terms  now  used  in  the  language 
domain  (defined  by  Carroll,  Chapter  1),  how  can  we  infer  underlying  compe- 
tence from  any  particular  performance? 


Greeno  ^Chapter  7)  suggests  two  criteria  for  meaningfulness:  “goodness  of 
structure”  and  “solving  in  the  problem  domain”  (pages  147-148).  “Goodness  of 
structure”  is  an  accurate  name  for  the  criterion  used  in  language  development 
research.  When  a child  says  / don’t  want  milk,  we  decide  whether  to  credit  that 
child  with  the  knowledge  of  don’t  as  an  anxiliary  Ly  determining  whether,  in 
this  child’s  language  system,  don’t  is  simply  a single,  unanalysed  chunk,’ an 
alternative  negative  to  no,  or  whether  it  coexists  with  other  auxiliaries,  positive 
as  well  as  negative-e.g.,  can  and  will  and  doesn ’t  as  well  a % don't.  “Goodness  of 
structure”  in  language  thus  means  that  a particular  element  is  a connected  part 
of  a larger  system,  and  is  recombinable  in  multiple  patterns  More  generally,  this 
is  also  Norman’s  concept  of  understanding  (Norman,  Centner,  and  Stevens 
Chapter  9). 

With  regard  to  evaluation,  Greeno  says,  "...  evaluation  must  come  from  the 
potential  users  of  the  product,  not  fro:  , one  who  proposes  to  offer  the  product 

for  use Evaluation  may  be  poss.L-k  when  ve  can  display  a relatively 

complete  analysis  of  the  knowledge  desired  in  some  subject  [p.  158].”  Working 
out  the  details  for  evaluating  cognitive  objectives  in  all  domains  should  become  a 
high  priority  task. 


TASK  ANALYSES 

If  we  accpt,  as  I think  we  should,  Norman’s  description  of  the  learning  process 
as  one  of  communication,  with  all  participants  attempting  to  form  new  mental 
structural  representations  that  will  account  for  the  infonnation  which  they 
experience,  then  two  kinds  of  task  analysis  are  needed.  One,  usually  called  by 
that  name,  is  an  analysis  of  what  the  child  has  to  learn;  the  other,  usually  called 


\ 
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by  some  other  name  such  as  diagnosis,  is  a description  of  what  the 
teacher  has  to  do  as  he  makes  decisions  (or  forms  hypotheses)  about  what  and 
how  to  teach.  The  first  is  discussed  by  Resnick;  the  second  is  discussed  by 
Atkinson  in  his  explanation  of  how  computer-assisted  instruction  can  be  indi- 
vidualized and  optimized,  and  by  Allan  Collins  in  his  demonstration  of  how 
tutors  of  geography  find  out  about  the  preinstructional  knowledge  of  their 
pupils.  The  task  of  diagnosis  is  different  for  computers  and  human  teachers,  not 
only  because  the  teacher  brings  to  the  task  a more  richly  prepared  mind  which 
may  help  or  interfere,  but  also  because  of  the  magnitude  of  the  information 
processing  task.  Of  the  ten  panels  assembled  by  the  National  Institute  of 
Education  in  July,  1974  to  suggest  plans  for  research  on  teaching,  one,  chaired 
by  Lee  Shulman,  considered  teaching  as  clinical  information  processing  in  this 
sense. 

One  of  Resnick’s  four  requirements  for  task  analyses  of  what  children  have  to 
learn  is  that  they  must  “recognize  a distinction  between  early  forms  of  compe- 
tence and  later  ones . . . [and  must]  describe  performance  characteristics  of 
novices  and  attempt  to  discover  or  point  to  key  differences  between  novices  and 
experts,  suggesting  thereby  ways  of  arranging  experiences  that  will  help  novices 
become  experts  [page  53] .”  Certainly  in  language  and  in  the  domains  of 
knowledge  studied  by  Piaget,  and  probably  in  others  as  well,  the  knowledge  of 
novices  is  not  simply  an  incomplete  version  of  the  knowledge  of  the  expert  or 
mature  learner;  it  is  qualitatively  different.  While  some  of  the  differences  may, 
upon  closer  examination  of  the  underlying  process,  be  due  to  the  lack  of  some 
conceptual  or  procedural  knowledge,  other  differences  seem  more  a result  of 
attention  to  different  features  of  an  event  than  is  characteristic  of  adult  minds 
and/or  embodied  in  culturally  transmitted  forms  of  representation. 

In  the  course  of  language  development,  for  example,  children  utter  verbs  like 
goed  and  unpeel,  and  they  answer  the  question  What  are  you  doing?  in  the  form 
/ doing  dancing.  These  immature  forms  car  reasonably  be  considered  as  the  lack 
of  knowledge  of  contextual  restrictions  on  the  generality  of  a particular  pattern. 
And  the  childish  question  Why  I can't  go?  can  reasonably  be  considered  as  the 
lack  of  a particular  operation  that  reverses  pronoun  and  auxiliary.  But  explana- 
tions of  incompleteness  do.  not  so  easily  fit  the  categorizations  of  sounds 
underlying  young  children's  invented  spellings  (Read,  1971),  nor  children’s 
representations  of  musical  tunes  that  Jeanne  Bamburper  described.2 

There  are  also  individual  differences  in  the  course  of  learning,  and  maybe 
culturally -derived  group  differences  as  well.  These  are  only  beginning  to  be 
looked  at  in  research  on  language  learning.  It  seems  to  be  a general  methodologi- 
cal principle  that  the  search  for  universal  patterns  precedes  the  s»arch  for 
variations  on  them. 

I am  not  suggesting  (even  if  I knew  how)  to  buik.  such  qualitatively  different 
novice  stages  into  an  instructional  program  as  intermediate  objectives.  But  we 


’See  footnote  on  page  317. 
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need  to  take  them  into  account,  not  just  to  value  them  as  indicators  of  learning, 
but  also  to  -’periment  with  how  best  to  use  them  as  the  basis  for  continued 
growth.  Jeanne  Bamburger’s  demonstration  of  children’s  learning  musical  struc- 
ture is  a fine  exampie.  As  she  said,  children’s  muddling  must  gdide  teacher’s 
meddling. 

Flavell  (1972)  has  made  an  important  attempt  to  categorize  the  restructuring 
that  takes  place  between  points  in  cognitive-developmental  sequences  as  not  just 
addition,  but  also  substitution,  modification,  inclusion,  and  mediation.  Nor- 
man’s protocols  of  adults  learning  to  cook  and  learning  a computer  programming 
language  show  that  the  knowledge  of  adult  learners,  as  well  as  children,  must 
often  be  modified  and  not  just  added  to. 

In  advocating  this  more  complex  model  of  the  course  of  learning,  I don’t  mean 
to  devalue  examples  of  the  best  we  now  know  how  to  do  in  instructional 
design  -such  as  Resnick’s  arthmetic  hierarchy  and  Atkinson’s  sequences  in  each 
strand  of  his  program  for  beginning  reading.  But  in  the  longer  range,  I hope  that 
Norman’s  model  (more  adequately  described  in  his  text  than  in  his  too  addi- 
tively  drawn  diagrams)  will  guide  future  research. 

In  all  task  analyses,  it  is  essential  to  remember  that  a formal  analysis  of  the 
structure  of  some  knowledge  or  skill  does  not  necessarily,  or  even  probably, 
reflect  the  organization  in  anyone’s  head,  much  less  how  it  got  there.  I’m  not 
sure  whether  Greeno’s  statement  that  “a  theory  specifying  cognitive  structures 
and  processes  sufficient  to  perform  those  tasks  is  a candidate  hypothesis  about 
what  the  instruction  is  trying  to  produce  [p.  124]”  clarifies  or  confuses.  The 
recent  history  of  psycholinguistic  research  reminds  us  of  the  importance  of  this 
distinction,  as  Carroll  and  Just  and  Carpenter  point  out.  And  there  is  no  reason  to 
believe  that  the  story  will  be  different  elsewhere.  If  we  intend  only  effective 
instruction,  then  the  justifying  criterion  of  effectiveness  may  be  sufficient,  as 
Greeno  and  Resnick  suggest.  But  if  a model  of  cognitive  processes  is  sought,  then 
more  thorough  psychological  validation  is  required.  Anyone  engaged  in  such 
endeavors  should  read  Holt’s  satire  of  task  analysis  in  his  cescription  of  how 
children  would  be  taught,  presumably  less  successfully,  how  to  talk  (Holt,  1967, 
pp.  56-57),  if  only  to  be  sure  where  and  how  he  is  wrong. 


MEANS  OF  INSTRUCTION 

As  Resnick  demonstrates,  even  with  a “good”  task  analysis,  the  mapping  from 
identified  components  to  instructional  strategies  remains  very  much  a matter  of 
artful  trial  and  error. 

Negatively,  I am  concerned  about  the  rigidity  of  instruction  embodied  in 
Atkinson’s  second  and  third  optimization  levels  [pages  93-99] . The  first  level, 
optimizing  the  sequence  within  each  strand,  is  less  of  a problem  just  because 
a smaller  component  of  the  total  program  is  involved.  But  when  all  con- 
trol is  taken  away  from  the  learner,  regardless  of  what  and  when  he  is  eager 
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to  learn,  adaptation  to  individual  interests  and  motivation  becomes  unpossible. 

1 realize  that  the  effectiveness  of  second-level  optimization  is  an  empirical 
question,  but  it  is  not  easily  investigated.  Not  if  we  evaluate  the  effects  o 
instruction  in  long-range  terms  and  look  for  side  effects  along  this  way,  as  Olson 
urged  in  his  comments. 

Atkinson’s  data  on  third-level  optimization  is  extremely  interesting.  In  a 
cu.rei.t  controversy  about  ways  to  increase  equality  of  educational  opportunity. 
Sesame  Street  (which  reaches  everyone)  is  pitted  against  Headstart  (which  is 
limited  to  poor  children).  Atkinson’s  dab  is  the  first  I’ve  seen  that  specifies 
exactly  what  has  to  be  done  to  maximize  the  group  average,  or  minimize  the 
group  variances,  or  increase  the  average  while  not  increasing  the  variance,  s 
research  data  it  is  very  important.  But,  although  1 agree  with  Atkinson  s 
selection  of  the  third  goal  as  his  optimizing  criterion,  the  question  of  the  effect 
on  individual  children  of  the  rigidity  entailed  by  any  attempt  to  optimize  at  this 

level  still  remains.  . 

More  positively,  1 want  to  emphasize  the  importance  of  questions  about  the 

form  of  instruction,  its  timing,  and  the  value  of  practice.  What  are  the  most 
effective  forms  in  which  to  communicate  to  the  learner  what  he  needs  to  know 
to  make  progress?  Some  learning  takes  place  through  the  full  participation  of  the 
learner  from  the  beginnin| , regardless  of  the  immature  forms  that  the  learner 
uses  with  subtle  and  little  understood  forms  of  feedback  producing  successively 
more  mature  approximations  of  expert  behavior.  Oral  language  is  perhaps  the 
best  example  here,  but  it  is  not  the  only  one.  1 have  seen  three-year-old  native 
American  boys  dancing  with  older  siblings,  parents  and  grandparents,  doing  an 
incredible  job  (by  our  strudards  of  age-appropriate  motoric  development)  of 
keeping  in  time.  It  may  re  the  case  that  full  participation  even  with  cluld.sh 
performance  is  characteristic  of  those  learnings  for  which  special  educational 
settings  are  not  required.  But  we  could  try  out  extensions  of  this  model  to  our 
more  formal  educational  objectives.  We  could  let  children  learn  to  write  without 
correcting  their  spelling;  and,  as  Norman  suggested  in  one  discussion,  we  could 
let  children  leam  to  read  without  correcting  ev-'ry  semantic  error  they  m3  e. 

Where  this  more  informed  model  of  learning  does  not  apply,  what  is  the  best 
level  on  which  to  focus  the  learner’s  attention?  In  teaching  a motor  skill  such  as 
square  dance  swinging,  I have  found  it  more  helpful  to  use  the  metaphor  of 
making  your  feet  go  as  if  on  a scooter  than  to  detail  the  component  motions.  In 
teaching  subtraction,  Resnick  suggests  that  it  is  more  effective  not  to  tiy  to 
teach  the  mature  algorithm  directly,  but  rather  to  find  a more  easily  instructable 
process  from  which  the  children  can  discover  the  rest  themselves.  Clearly,  more 
research  is  needed  on  effective  forms  of  instruction  for  varied  educational 

the  optimal  timing  for  confronting  the  learner  with  the  discrepant 
information  he  needs  to  move  on?  In  the  kind  of  informal  learning  that  proceeds 
from  full  participation  in  ongoing  experience,  that  discrepant  information  is 


18.  IMPLICATIONS  FOR  INSTRUCTIONAL  RESEARCH  323 


always  available,  and  the  child  makes  selective  use  of  it  in  his  own  time.  But  in 
more  formal  educational  settings,  out  of  the  contexts  where  experts  are  perform- 
ing alongside  learners,  we  face  timing  decisions  that  axe  difficult  indeed. 

Finally,  what  about  the  value  of  practice?  It  was  only  discussed  informally 
here.  Yet  practice  is  one  variable  whose  role  may  vary  significantly  between 
short-term  experimental  and  long-term  educational  types  of  instruction.  Carey 
(1974)  suggests  that  the  practice  of  constituent  skills  necessary  for  their  incorpo- 
ration into  more  complex  structures-“modularization  in  Bruner’s  term-applies 
to  cognitive  achievements  as  well  as  to  motor  skills.  Children  engage  in  self- 
generated practice,  (see  Cazden,  1972,  pp.  93-97;  Cazden,  1974,  for  language 
examples).  Does  the  effect  of  practice  on  learning  vary  depending  on  whether  it 
is  so  self-generated  or  designed  and  imposed  by  someone  else?  If  practice  is  the 
label  we  give  to  seemingly  valuable  behavior,  than  “bad  habit”  is  the  comparable 
label  for  repetitions  of  behavior  that  leave  something  to  be  desired.  Immature 
language  forms,  such  as  those  given  above,  are  easily  replaced  as  development 
proceeds,  no  matter  how  often  repeated  they  may  have  been.  Can  we  specify 
more  generally  the  categories  of  behavior  to  which  the  notions  of  practice  and 
bad  habit  do  and  do  not  apply?  These  questions  are  admittedly  imprecisely 
formulated.  Because  of  their  importance  for  education,  I hope  they  can  be 
clarified  from  the  perspective  of  this  volume. 
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What  do  we  know  about  what  happens  to  a learner  during  instruction?  One  of 
the  most  exacting  criteria  for  testing  our  knowledge  about  any  phenomenon  is 
the  extent  to  which  we  can  build  a model  that  exhibits  the  behavior  being 
studied.  If  we  can  simulate  it,  then  we  have  at  least  a sufficiency  model.  Of 
course,  there  may  be  many  aspects  of  the  model  that  lack  plausibility , but  they 
can  then  become  the  focus  of  further  study.  (Reitman,  1967,  once  characterized 
this  simulation  approach  to  cognitive  psychology  as  a way  to  “invent  what  you 
need  to  know.”)  In  this  chapter  I will  raise  some  questions  about  how  one  might 
go  about  building  a model  of  a learner  in  an  instructional  mode  (MOLIM). 


LEARNING  AS  PROBLEM  SOLVING 

To  learn  is  to  solve  a problem.  In  all  but  the  most  elementary  situations,  learning 
is  under  the  learner’s  strategic  control  of  attention  and  memory.  If  this  view  of 
learning  is  valid,  then  the  study  of  complex  problem  solving-and  the  orientation 
such  study  provides  to  cognitive  psychology-has  direct  relevance  for  the  design 
of  a MOLIM.  In  this  section  I will  mention  a few  features  of  problem-solving 
theory  that  seem  to  justify  the  view  of  learning  as  problem  solving,  and  that  also 
have  particular  importance  for  the  study  of  learning.  Then,  in  the  next  section  I 
will  raise  some  questions  about  the  design  of  a MOLIM. 

Current  information  processing  approaches  to  the  study  of  human  problem 
solving  proceed  by  postulating  a general  system  architecture,  and  then  construct- 
ing explicit  representations  for  the  data  structures  and  the  processes  that 
generate  the  observed  problem  solving  behavior.  A problem  solution  consists  of 
an  internal  representation  for  some  knowledge  that  the  system  did  not  have  at 
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the  outset.  Problem  solving  consists  of  a series  of  local  transformations  of 
knowledge  that  ultimately  reach  the  desired  knowledge  str.te.  In  several  of  the 
chapters  in  this  volume  (e.g.,  the  Greeno’s  Chapter  7),  the  “solution”  to  the 
learning  problem  is  explicitly  represented  as  a data  structure  (semantic  net)  and 
a set  of  procedures  for  searching  that  network.  But  note  that  these  results-these 
solutions  to  the  learning  problem-are  static  with  respect  to  the  learning  process 
itself.  That  is,  with  respect  to  the  time  grain  of  the  instructional  process,  the 
results  of  instruction,  even  though  they  may  themselves  by  dynamic  processes, 
are  structures  upon  with  the  learning  system  must  operate.  We  need  a model  of 
the  system’s  response  to  instruction,  that  is,  its  functioning  in  circumstances  in 
which  it  must  attend  to  the  instructional  episode  and  modify  its  own  per- 
formance structures  and  processes. 

In  our  instructional  efforts,  we  try  to  provide  optimal  environments  for  the 
human  information  processing  system  to  learn  something.  As  it  is  with  the  horse 
led  to  water,  so  it  is  with  the  learner  in  an  instructional  situation:  we  can’t  make 
it  ingest  what  we  offer.  The  instructional  design  question  is  typically  "Will  the 
learner  learn  from  this  instruction?”  A further  question  should  be  "Why  should 
he  learn?”  The  viev/  of  learning  as  problem  solving  suggests  some  ways  to 
characterize  this  question.  Problem-solving  theory  (Newell  & Simon,  1972) 
includes  two  features  of  importance  for  our  purpose.  One  is  a detailed  internal 
representation  of  the  task  environment.  The  other  is  a characterization  of  how 
the  human  information  processor  allocates  its  limited  processing  capacity  to  the 
problem-solving  process.  A principal  method  for  effecting  this  allocation  is  the 
use  of  explicit  representations  for  goals.  Goals  are  symbolic  expressions  that 
direct  and  control  the  course  of  problem  solving,  representing  what  the  system 
“wants”  to  do  at  any  moment,  and  “why”  it  wants  to  do  it.  Thus,  the  answer  to 
whether  or  why  the  system  will  learn  becomes,  in  view  of  learning  as  problem 
solving,  a matter  of  stating  the  circumstances  under  which  learning-related  goals 
are  generated  and  manipulated. 

SOME  DESIGN  QUESTIONS  FOR  A MOLIM 

In  this  section,  I will  raise  four  questions  that  must  be  answered  by  the  designer 
of  a MOLIM: 

1.  When  should  learning  occur? 

2.  How  will  the  system  be  changed  as  a result  of  learning? 

3.  How  thoroughly  assimilated  is  the  thing  to  be  learned? 

4.  How  distinct  is  learning  from  performance? 

These  questions,  and  their  answers,  are  highly  interrelated,  and  it  is  difficult  to 
determine  their  appropriate  order  of  presentation.  Their  ordering  here  is  arbi- 
trary, and  does  not  imply  any  particular  differential  importance  in  my  mind. 


19.  DESIGNING  A LEARNER  327 


1,  When  should  learning  occur?  A curious  problem  with  most  of  the  learning 
models  in  both  cognitive  psychology  and  artificial  intellignece  is  that  they  are 
too  single-minded  in  their  task:  they  learn  all  the  time.  In  designing  a plausible 
MOLiM,  we  must  be  able  to  account  for  the  fact  that  most  of  the  time  learning 
does  not  occur.  We  can  do  this  by  explicitly  including  in  our  MOLIM  the  precise 
conditions  under  which  an  instructional  episode  causes  something  to  be  learned. 
This  is  where  the  appropriate  use  of  goals  could  play  a role.  Rather  than 
construct  a system  in  which  the  tendency  to  learn  is  integrally  built  in  to  the 
underlying  operating  mechanisms,  we  can  design  a more  general  problem  solver 
whose  problem  is  to  learn,  and  whose  goals  include  explicit  learning  efforts. 

By  themselves,  such  goals  would  still  be  inadequate  for  deciding  when  the 
system  should  bam.  Additional  information  would  be  required  about  the 
current  stati  of  knowledge— that  is,  about  both  the  current  configuration  of  the 
external  environment  as  well  as  the  internal  state  of  the  system  Thus,  another 
H*«ign  decision  concerns  those  variables  and  their  critical  ranges  which  would,  in 
conjunction  with  the  learning  goals,  activate  the  self-modification  processes. 

The  mechanisms  that  determine  when  learning  is  to  occur  must  be  capable  of 
representing  differential  responsiveness  to  instruction.  As  Resnick  (Chapter  3, 
this  volume)  has  pointed  out,  our  models  must  be  able  to  represent  both  early 
and  late  forms  of  task  proficiency,  and  for  a MOLIM,  the  task  is  learning  itself. 
Therefore  a MOLIM  must  incorporate  the  capacity  to  represent  both  early  and 
late  learning  proficiencies.  Siegler  (1975)  has  noted  the  importance  of  experi- 
mental designs  in  which  both  older  and  younger  children  are  given  the  same 
training  sequences,  in  order  to  examine  the  posable  interaction  of  age  and 
instructional  effects.  Since  such  interactions  have  been  found  (e.g.,  Siegler  & 
Liebert,  1975),  we  must  be  able  to  represent  them  in  MOLIM,  through  the 
general  strategy,  suggested  by  Resnick,  of  building  developmentally  tractable 
models. 

2 How  will  the  system  be  changed  as  a result  of  learning?  This  is,  perhaps,  a 
more  useful  way  to  say  “what  is  learned?”  There  are  several  outcomes  that  can 
result  from  the  learning  effort.  One  result  is  that  nothing  happens:  the  learning 
attempt  fails,  and  no  lasting  change  is  made  in  the  system  As  noted  above,  this 
is  more  the  rule  than  the  exception  in  real  instructional  situations,  and  we  must 
be  able  to  build  a system  that  can  handle  this  fact. 

Another  possible  result  is  that  the  entire  system  architecture  could  change. 
That  is,  the  system’s  components  and  their  interrelationships  might  be  altered. 
However,  since  by  “system  architecture”  I mean  “hardware”  rather  than  the 
“software”  of  the  human  information  processor,  it  seems  unlikely  that  this  kind 
of  change  is  really  the  result  of  instruction.  Although  it  would  be  required  in  full 
developmental  theory,  we  need  not  be  too  concerned  with  it  for  now. 

There  are  two  kinds  of  software  changes  that  the  system  can  undergo:  changes 
in  processes  and  changes  in  structures.  Newell  (1972a)  has  demonstrated  the 
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imprecise  nature  of  the  process-structure  distinction  in  systems  that  are  them- 
selves undergoing  change.  In  the  case  of  a self  modifying  system,  the  ambiguity 
becomes  even  greater,  since  it  is  linked  to  the  issue  of  degree  of  assimilation  to 
be  described  below.  Although  one  often  makes  an  apparently  unambiguous 
distinction  between  instruction  directed  to  acquisition  of  facts,  and  instruction 
directed  to  the  teaching  of  procedures  (or  skills),  it  is  clear  from  the  work  of 
Greeno,  Hyman,  and  Norman  (Chapters  7,  8,  and  9 of  this  volume)  that  the 
issues  are  not  so  simple.  One  can  represent  factual  knowledge  by  procedures  that 
can  generate  those  facts,  and,  conversely,  one  can  represent  what  could  be 
procedural  outputs  by  appropriately  complex  static  symbolic  networks. 

Another  type  of  change  that  may  result  from  instruction,  and  which  we  must 
therefore  be  prepared  to  explicitly  represent  in  our  MOLIM,  is  change  in  the 
learning  properties  of  the  system  beyond  the  representation  of  the  specific 
instructional  material.  For  example,  in  the  case  of  the  aggregate  models  with 
which  Atkinson  represents  the  learner,  there  are  a few  changes  in  the  acquisition 
parameters  that  result  from  the  instruction.  In  more  complex  models  of  learning, 
such  systemic  modifications  would  include  the  basic  rules  of  self-modification 


3.  How  thoroughly  assimilated  is  the  thing  to  be  learned?  In  the  paper  cited 
above,  Newell  (1972)  distinguishes  between  several  levels  of  general  versus 
specific  knowledge  about  a task.  The  more  general  the  knowledge,  the  more 
transformational  rules  are  necessary  to  take  the  system  from  its  entry  state  at 
performance  tune  to  a task-specific  state  in  which  it  can  actually  perform  the 
task  at  hand.  Conversly,  a very  task-specific  piece  of  knowledge  might  be 
represented  in  “machine  code”:  being  fully  assimilated  it  would  require  no 
interpretation  at  run  time,  however  it  would  be  of  limited  generality. 

A concrete  example  of  this  distinction  is  provided  by  the  models  for  children’s 
performance  on  seriation  tasks  developed  by  Baylor  end  Gascon  (1974).  In  these 
models  there  are  two  kinds  of  representations  for  “seriation  knowledge.”  One  is 
a base  strategy,  consisting  only  of  series  of  nested  goals,  that  describe,  at  the 
most  general  level  a strategy  for  seriation  (e.g.,  “find  max,”  or  “insertion”).  The 
other  representation  is  a rule  set  that  accounts  for  each  meve  made  by  the  child 
u ring  a specific  seriation  task.  Behavior  during  length  seriation  has  one  rule  set 
and  behavior  during  weight  seriation  has  another.  If  the  system  has  only  the  base’ 
strategy,  then  it  also  requires  a set  of  rules  that  take  the  base  strategy  and 
construct  a task-specific  variant  (e.g.,  for  weight  seriation).  There  are  various 
ways  to  conceptualize  this  mapping.  The  two  simplist  are  a complete  “compila- 
uon,  in  which  the  base  strategy,  plus  the  task-specific  mapping  rules,  create  an 
entire  task-specific  system  that  then  runs  on  the  task.  The  other  is  a collection  of 
interpretive  rules  that  never  create  a task-specific  entity,  but  instead  interpret 
the  base  strategy,  “on  the  run,”  in  terms  of  the  specific  task. 

In  designing  a MOUM,  we  must  decide  upon  the  assimilatedness  of  the 
information  to  be  acquired.  The  sematic  networks  of  Norman  and  Greeno  in  this 
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volume  appear  to  be  far  toward  the  task-specific  end  of  the  spectrum,  while 
Shaw  and  Wilson  appear  to  be  focusing  upon  a more  general  “base  strategy”  in 
their  representations  of  group  generators.  A similar  contrast  can  be  found  in 
comparing  Atkinson,  (Chapter  4)  with  Resnick  (Chapter  3).  Atkinson  is  aiming 
at  an  instructional  procedure  that  will  create  a very  specific  set  of  data  structures 
and  processes  that  will  enable  the  learner  to  write  programs  or  to  acquire  a set  of 
reading  patterns.  Resnick  has  begun  to  investigate  the  manner  in  which  the 
learner  abandons  the  task-specific  instructions  and  creates  a more  efficient  and 
general  procedure.  My  strategic  bet  is  that  by  representing  the  result  of  learning 
as  “base  plus  interpreter,”  we  may  begin  to  get  a handle  on  the  mechanisms  c ! 
generalization  from,  or  beyond,  the  specific  instructional  sequence. 

4.  How  distinct  is  the  learning  system  from  the  performance  system ? In 
almost  all  models  of  learning,  be  they  psychological  modek  or  examples  of 
Artificial  Intelligence,  there  is  a clear  distinction  between  the  learning  processes 
and  the  thing  to  be  learned,  that  is,  the  performance  system  (see  for  example  Jie 
models  in  Feigenbaum  & Feldman,  1963,  or  Simon  & Siklossy,  1972).  The 
distinctions  are  made  with  respect  to  the  over-all  organization  of  the  respective 
systems,  the  underlying  representations,  and  even  the  basic  system  architecture. 
For  example,  in  the  letter  series  completion  model  of  Simon  and  Kotovksy 
(1963),  much  attention  is  paid  to  the  differential  short-term  memory  demands 
made  by  different  representations  for  different  serial  concepts,  but  the  demands 
made  during  the  induction  of  these  concepts  (i.e.,  during  their  learning)  are  not 
directly  addressed.  Another  example  of  this  distinction  can  be  found  in  Water- 
man's (1970)  learning  program  in  which  the  result  of  training  was  represented  as 
a production  system,  but  the  learning  system  itself  was  not production  system. 

Although  such  separation  has  the  benefit  of  making  the  modeling  task  more 
manageable,  it  lacks  both  elegance  and  psychological  plausibility.  1 would  hazard 
the  guess  that  the  same  mechanisms  that  span  the  gap  between  general  base 
strategy  and  the  task-specific  system  (sec  Question  3,  above)  are  implicated  in 
the  learning  process  itself.  The  more  homogeneously  we  design  the  MOLIM,  the 
more  likely  we  are  to  be  able  to  solve  both  problems  simultaneously.  Such  a 
view  might  be  nothing  more  than  idle  speculation  were  it  not  for  the  recent 
work  of  my  colleague,  Don  Waterman.  He  has  constructed  a set  of  adaptive 
production  systems  for  a range  of  learning  tasks  (Waterman,  1974a  b).  These 
models  learn  simple  addition,  verbal  associations,  and  complex  letter  series.  Each 
model  is  written  as  in  initial  cere  of  productions,  some  of  which  have  the 
capacity  to  add  additional  productions  to  the  initial  core.  The  final  “learned” 
system  operates  under  the  same  control  structure  and  system  architecture  as  the 
initial  syi  em  and  the  learning  rules  are  represented  in  precisely  the  same  way  as 
the  new  rules  that  are  learned,  that  is,  as  productions. 

The  instructional  environments  in  which  l’  .i'erman’s  system  do  their  self- 
modification are  relatively  simple,  but  1 beli  ve  That  the  basic  approach  is  very 
sound,  and  extendable  to  ticher  instructional  pioblems.  !n  a less  precise  but 
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much  more  general  statement,  Wallace  and  I (Klahr  & Wallace,  1976)  hav< 
°f  * — *■-«&! 


CONCLUSION 

^estionPoMhin  T VOlUT  repre“n‘  diVCrSe  bu‘  conver8inE  answers  to  the 
question  of  the  relevance  of  some  current  research  in  cognitive  psychology  to 

struct lonal  design.  I have  not  attempted  to  synthesize,  evaluate,  or  review  the 

previous  chapters^ because  several  efforts  have  already  been  made  in  the  dlus- 

, on  chapters  by  oregg,  Olson,  Famham-Diggory,  Hayes,  Collins,  Shaw,  Glaser, 

heln^  H ’ 7 in,en‘  b**  been  ,0  pr0vide  “ orientation  that  might 
help  the  reader  to  form  his  own  evaluation  of  the  research  reported  here 

By  confining  attention  to  learning  in  intentionally  instructional  situations  I 

"oTocat,LTP,ev  ,0  MUCe  ^ ,aSk  ,0  man^Cable  ProPortions.  Instead  ’of 
liOncocting  a general  learrung  system,  l have  considered  the  design  of  more 

m ted  models  of  the  effects  of  specific  instructional  situations.  Such  models 
will  in,  .aJy  tend  to  be  largely  determined  by  the  task  environment,  that  is,  by 
the  mater, <d  and  its  form  of  presentation.  However,  there  are  some  fundamental 
questions  that  are  worth  asking,  questions  which  may  apply  to  a wide  range  of 
instruction,  even  though  their  answers  may  be  task-specific. 

Having  posed  the  design  issues,  we  might  ask  a few  questions  about  the 

“;rBt’.ptr  “•  my  bo,her  ^ such  an  effort?  There  seem  to  be  a few  good 
reasons.  First,  if  we  could  actually  build  a sensible  model,  we  could  directly 
simulate  the  results  of  proposed  instructional  procedures.  The  potential  value  of 

piant,.  is  they  couid  replace  i 

in  the  ! Vari8ti0nS  tha'  WC  316  PfeSCntly  forced  to  use'  But  except 

lol  Tjn  ”MT’  **  “ ab"  •»  MM  -d.  models,  and  l 

worth  of  the  enterprise  lies  in  its  propaedeutic  nature:  it  may  give  us  an 

introduction  ,0  the  kinds  of  things  we  still  need  to  know  Several  of  , Z 
3T|n'S  by  0ther  ^scussants  in  this  volume  have  raised  the  disquieting 
possibility  that  we  may  have  little  here  that  is  really  new  or  useful.  I tfonk  that 

bsL3,  V,CW  ‘S  !!nJUStified’  bUt  11,6  iSSUC  Cannot  real‘y  he  addressed  in  The 
abstract,  we  need  concrete  examples  of  what  we  are  talking  about.  Thus  the 

second  reason  for  attempting  to  raise  some  design  issues  is  that  the  exercire  of 

con^TxL'pT1''  "™in8  f'0m  mnK'l°"  W'"  V™*  “ 

Jh.t  .e  can  ask  ,bou,  ,he  *.%1  of  , M0L|M  „ 
o might  benefit  from  such  an  exercise?  It  seems  to  me  to  be 

conceived)  'coulrbe'113,'^6'^'^'’0'5  " !eam'rS  (at  least  as  traditionally 

ZTlL  ™v.r^  ? ” ,hlnkl,,g  abou'  ,ht  d«W>  »f  learning 

models.  The  payoff,  al  present,  appears  .o  be  fo,  die  people  who  fall  into  the 
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intersection  of  the  categories  of  instructional  designer  and  cognitive  psychol- 
ogists. The  contributors  to  this  volume  were  selected  because  of  just  such  a 
blend  of  interests  and  skills.  Their  answers  to  some  of  the  questions  I have  raised 
are  implicit  in  the  work  they  have  presented  in  previous  chapters.  Perhaps  other 
“learning  engineers”  can,  in  reaching  their  own  answers,  begin  to  apply  and 
direct  the  kinds  of  basic  research  that  are  required  tc  further  our  knowledge  of 
both  cognition  and  instruction. 
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and  task  analysis,  1 1 3-1 14 
Independence  of  cognitive  processes,  23-48, 
111 

sources  of  dependency,  35 
Information,  role  in  language  acquisition,  1 7 
Information  processing 
definition,  64-65 
and  task  analysis,  63-80 
Information  ne'work,  of  BIP,  85-89 
Initial  states  of  knowledge 
description  and  diagnosis,  308,  3 1 2 
see  also  Expert-novice  distinction, 
Knowledge 

Insight,  58,  59,  319,  see  also  Meanin  [fulness 


Instmclability,  52,  62,  307 
Instruction 

cognitive  theory  of,  117-120 
in  cognitive  skills,  238 
definitions,  3 -5,  SI,  318 
in  difficult  contexts,  291-299 
in  foreign  language,  4,  7 
role  of  generative  concepts,  217-220 
theory  versus  design,  1 18 
see  also  Adaptive  instruction.  Tutorial 
method 

Instructional  design  and  cognitive 
psychology,  303-316 
Instructional  examples  of  negation, 
260-262 

lnsiructional  goals,  123-159 
Instructional  growth,  223-234 
Instructional  implementation,  310 
Instructional  means,  120,  321-323 
Instructional  objectives,  123-159 
Instructional  relevance,  52,  61 
Instructional  sequence,  224-225 
Instructions 

containing  implicit  negatives,  255-262 
comprehension  in  everyday  situations, 
268 

oral  versus  written,  270,  274-275,  322 
simplifying  and  clarifying,  283-284 
see  also  Comprehension,  Understanding 
Internal  components  of  verbs,  178-183 
Invariance,  2 1 0-2 1 7 
Invention.  76 
Isomorphism,  271, 282 


J 

The  jure p problem,  190-192 


K 

Knowledge 

acquisition,  238,  245,  313-314 
of  abstract  concepts,  197-221 
during  impression  formation,  161-176 
underlying  cognitive  capacity,  1T7-221 
diagnosis  of  initial  states,  308,  312 
event  concepts,  2JO-217 
generative  linguistic,  201-203 
objective,  1 1 8 

origins  versus  representations,  298 
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Knowledge  (contd  ) 
partial 

in  language  acquisition,  1 1 
in  talk  analysis,  S3,  62-63,  73 
prior 

in  language  acquisition,  1 1 
of  results,  275 
in  task  analysis,  55-58 
relation  to  performance,  119-120 
for  solving  problems.  '36-149 
structure,  321,  325 
transmission,  1 1 9 
and  understanding.  277-282 

L 

Language  acquisition.  4-21 
behavioral  theory  of.  12-15 
by  chimpanzees.  14.  1 9 
cognitive  theory  of,  15-18 
by  exposure,  4 
of  foreign  language.  4,  7 
CA1  program.  99-106 
generative  knowledge,  201-203 
by  imitation.  20 
individual  differences  in,  8.  22 
by  internalization.  4 
models  of,  8-21 
naive  theory  of,  9-12 
of  native  language.  4 
representation  for  verbs  of  po session 
180-183 

see  alto  Competence  and  performance 
Language  acquisition  devide  (LAD),  4,  I 9 
Language  learning,  see  Language  acquit  ion 
Langi’ige  skills,  4-22 

defines.'.  4 

in  foreign  language.  4,  7 
in  native  language.  4 
models  of  learning,  5-21 
see  also  Language  acquisition 

Learner-controlled  instruction,  106  318 

321 

Learning 

cognitive  theories,  1 1 0-1 1 1 
definition,  319 

as  distinct  from  performance,  329 
facilitating  conditions.  309-310 
how  will  it  change  the  learner.  327 
by  modification  of  schemata,  184-190 
as  problem  solving,  325-326 
of  serial  patterns,  1 1 5 


^earning  {contd. ) 
strategies  in,  114-115 
when  should  it  occur,  327 
Learning  hierarchies,  55-58 
Learning  to  learn,  78-80  194,  287 
Linking  science  between  psychology  and 
instruction,  303-305 

M 

Matching  to  determine  isomorphism,  282 
Mathematics 

arithmetic,  task  analysis,  54-58.  65-70 
fractions,  125-135 
generative  concepts,  200-201 
geometry,  task  analysis,  58-59,  35-150 
task  analysis,  51-80 
Maximal  rate  contours,  90 

Mayonnaise  problem  185-186 

Meaningfulncss.  145-149.  319 
Measurement.  76,  see  also  Mathematics 
Meddling,  I 1 9 
Memory 
external,  265 
recognition.  161-176 
reconstructive  aspects,  174 
representation  in.  1 77-194 
restructuring  c in  impression  formation 
173  176 

see  also  Representation.  Semantic 
memory  , STM 
Mental  computation.  1 27 
Mental  processes  in  sentence 
comprehension,  245-268 
Metaphor,  281-282.  322 
Modeling.  1 1 9 

MOLIM  (model  of  a learner  in  an 
instructional  mode),  325-331 
Monster  problems,  272-275,  278-279 
288-239.  297 

Motivation  in  language  acquisition.  9,  16 
Motor  skill,  322 
Move  iterator,  2/3-274 
Meddling,  1 19 

N 

Negatives 

applied  to  quantifiers.  255 
denial,  251-252 
implicit,  255-260 
relative  difficulty,  267 
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Negation 

tepresenMion  of,  246 
tcope  25 1 

in  senienc..  completion  263-2/4 
time  to  process.  247 
Nominal  oncepts.  tec  Concepts 
Novice-expett  distinctions,  ter 
Expert -novice  distinctions 

O 

Optimal  control  theory.  91 
Optimization  of  learning,  81-107.  31 2. 
321-322 

for  class  performance,  96-98 
in  German  vocabulary  learning,  HX3-106 
for  individual  students.  93-96 
Optimal  sequencing,  tec  Optimization  of 
learning 

Oral  communication.  270,  322 
P 

Pattern  recognition,  1 37-1  39,  144-145 
Perceiving  aflordancr  structure,  212-217 
Person  perception.  161-162.  238 
PLANNER.  142-144 
Planning.  236 
Practice.  323 

in  language  acquisition.  11,17.  99-102 
as  means  of  strengthening  S— R bonds.  54 
Predication.  246 
Praxis.  1 19 

Problem  solving.  319.  326 
course  in.  239-242 
evaluation  of  effectiveness,  239  -242 
knowledge  icquircd.  136-  149 
well  structured  versus  ill-structurcd. 
277-278.  280,  288-  289,  317 
Problem-solving  skills.  88 
Problem  space.  271 

Process-structure  distinction  327  328 
Production  systems.  22.  142,  144  329 
for  (ctformance  grammars,  7 
Productive  thinkin-.  58-61 
Professionals.  iec  expert  peifotmancc 
Propositional  networks.  1 39-144.  tee  alto 
Relational  networks 

Propositional  representation  of  sentences. 
246 

Prototypes.  203.  228.  See  alto  Stereotypes 


Psycholinguistics,  3-22.  245-268,  321 
Psychology  < instruction,  305-31 1 
Puzzles,  27e,  280 

Q 

Question  answering.  153-158 
R 

Rational  task  analysis,  tee  Task  analysis 
Reading 

computer  assisted  instruction  in,  89  -99 
ti'ts  of  individual  differences,  31-33 
tests  ot  process  independen  t.  30-31 
Reading  comprehension,  see  .'cm prehen- 
sion, Verbal  comprehension  res  is 
Reading  comprehension  tests,  265-266 
Recognition  of  relations.  1 37-1  39 
Reinforcement.  >09 
■n  language  ac qu. ittton.  1 . 13.  1 7 
Relational  networks.  145-149.  tee  a/so 
Propositional  networks.  Semantic 
memory 

Relevance  of  umculum.  31  7 
Representation.  272-273.  278.  3H 
of  concepts  as  procedures  1 25-1  35 
description  of.  in  UNDERSTAND,  274 
of  fractional  quantity.  126-1  35 
of  information  extracrcd  from  leniences. 

246  251 

vu  schemata  in  memory  . 177-194 
of  thematic  information.  265-266 
for  verbs.  1 78  , 83 
Rewards,  tee  Rcir/orcemenl 
Rote  learning.  177 
Rote  proofs.  146 

Russun.  computer  assisted  instruction  in. 
!(X) 

S 

Schemata.  161-176,  203,  230.  238 
ard  memory  representation,  177-194 
modification  of.  184 
selection.  184 
structure.  183-184 
Science  of  design.  304-305 
Scope  of  a negative,  rev  Negation 
Second-language  learning,  tee  language 
acquisition 
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Self-modification,  326,  328-329 
Semantic  knowledge,  278-280 
Semantic  memory,  111,  | 25,  272 
Semantic  network*.  150-153,  178-182 
229,319,  326,  328 
Semantic*  in  algebra  word  problems 
280-28! 

Sentence  completion  tasks,  262-265 
Sentence-picture  companion,  247-249 
287-288 

Sentence  representation 
affirmative*  and  negatives,  246 
embedded  clause*,  251-252 
Sentence  verification,  246-255 
Sequential  processes,  1 1 2 
Serial  pattern  learning,  1 15 
Set  theoretic  representation  of  fractional 
quantity,  1 30-1  35 
Similarity  to  stereotype,  164-174 
Simulation,  325 
Skill,  5-8 

Spatial  representation  of  fractional 
quantity,  126-135 
Stage*  of  competence.  5 3 
Stereotypes 

assimilation  to.  I 70,  229 
as  schema U in  memory,  161-176 
Sternberg's  additive-factor  paradigm 
description,  24-26 
generalisation  of,  20-36 
Stimulative  therapy  in  aphasia,  294-295 
STM  (Short-Term  Memory),  274-275,  329 
Strategics,  j ’8-329 
in  learning,  114-115 
in  understanding.  281-282.  285 
Structural  networks,  see  Acti.e  structural 
networks.  Semantic  networks 
Styles  of  interpretation,  285 
Subject  mallei 
addition,  71-72 
algebra,  280 

arithmetic.  54-58,  65-72 
art  appreciation,  298  299 
calculus,  236 

computer  programming,  82-89,  190-192 
285 

cooking.  185-190 
foreign  languages  3-8,  99-106 
fractions,  125-1  35,  237 
geography,  320 
geometry,  58-59,  1 35-150 
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German,  99-106 
problem  solving,  238-242 
psychophysics,  1 50—1 5 7 
reading,  30-34,  89-99 
riviting,  224-228 
Russian,  100 

second  language  vocabulary,  99-106 
subtraction,  69-71 
writing,  283 

Subtiaction  problems,  69-71 , see  also 
mathematics 

Syntactic  analysis  of  instructions,  271 
T 

Task  analysis,  31 1-312,  319-321 
definition,  51-53 
in  additive  factor  paradigm,  24-25 
empirical  analysis,  68-72 
Gangne’s  conlribulion,  55-58 
history,  53-63 

and  individualization,  113-114 
information  processing  approaches, 
63-80.  125-159 

in  instructional  design,  5 1 -80,  109-1 1 
in  mathematics  51-80.  125-149 
Piaget’s  contribution,  59-61 
rational  task  analysis,  65-68 
instructional  relevance,  52-53,  61 
satire  of,  321 

and  subject  strategies.  1 14-115 
Thorndike's  contribution,  53-55 
Wertheimer's  contribution,  58-59 
Task  sequences,  55-58 
Teachability,  see  Instructability 
Teaching 

routines  for,  72-78 
of  understanding,  284-285 
see  also  Instruction 
l esting,  see  PvaJ-ation 
Time  spans  in  attaining  competence,  31  3 
1 ransformations,  perception  of 
shape  of  nonrigid  objects,  212-21  3 
shape  of  facer,  213-217 
tea-making  event,  217 
Transition  matrix,  103 
Tutorial  dialogues,  187-190,  320 
Tutorial  method 

in  computer  assisted  in-truction,  87  -89 
Socratic,  227 
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u 

UNDERSTAND  computer  program, 
269-284 

Understanding,  58-59,  118-119,  269-285, 
287-289,  319 
criterion  of,  271 
different  styles  of,  281-282 
nature  of,  270-277 
puzzles,  278-280 
theory  of,  271-274 


Understanding  task  instructions,  269-285, 
317 

V 

Verbal  comprehension  tests,  247,  263-265 
W 

The  whitc-s  iuce  problem,  186-190 


